. .INSTITUTE of . 

Museum.odLibrary 

SERVICES 

IMLS INTERIM PERFORMANCE REPORT FORM 

For Projects with Award Dates between October 1, 2015 and September 30, 2018 
(i.e., award number ends in -16, -17, or -18) 

Please consult the IMLS Interim Performance Report Instructions when filling out this form. 


Federal agency and organization element to 
which report is submitted: 

Institute of Museum and 
Library Services 


2. Federal award or other identifying 
number assigned by federal agency: 

RE-85-17-0060-17 


4. Recipient organization (name and complete address, including ZIP+4/postal code): 

Internet Archive 

300 Funston Avenue 

San Francisco, CA 94118-2216 


6a. Award period of performance start 
date (MM/DD/YYYY); 

06/01/2017 


6b. Award period of performance end 
date (MM/DD/YYYY): 

08/30/2020 


Page 1 


of 23 
Pages 


3a. D-U-N-S* number: 
059453410 


3b. EIN/TIN: 94-32422767 


5. Recipient identifying or 
account number: 

059453410 


7. Reporting period end date 

(MM/DD/YYYY): 

05/31/2019 


8. Project URLs, if any: 
httDs://communitywebs.archive-it.org/ 

9. Report frequency: 

X annual 

□ semi-annual 

□ final 

□ other 

If other, describe: 

10. Other attachments? □ Yes X No 

Contact the appropriate IMLS program office to receive instructions for transmitting additional attachments. 

11a. Name and title of Project Director: 

lib. Telephone (area code, number, extension): 

1-415-561-6767 

Jefferson Bailey, Director of Web Archiving & Data Services 

11c. Email address: 

jefferson@archive.org 

12. Certification: By submitting this report 1 certify to the best of my knowledge and belief that this information is correct 
and complete for performance of activities for the purposes set forth in the award documents. 

13a. Signature of Authorized Certifying Official: 

13b. Date report submitted (MM/DD/YYYY): 

08/29/2019 


13c. Name and title of Authorized Certifying Official: 


Jacques Cressaty, Director of Finance 


13d. Telephone (area code, number, 
extension): 

1-415-561-6767 


13e. Email address: 
jacques@archive.org 


Burden Estimate and Request for Public Comments: Send comments regarding this burden estimate or any other aspect of this collection of information, including 
suggestions for reducing this burden, to the Institute of Museum and Library Services, 955 L'Enfant Plaza North, SW, Washington, DC 20024-2135. 

1 

IMLS-CLR-F-0023 


0MB No. 3137-0071, EXPIRES 01/31/2019 






























The purpose of the Interim Performance Report is to provide a record of grant-funded project activities at annual 
intervals throughout the grant period. If you have questions concerning the interim performance reporting 
requirements, you may address them to the Program Officer assigned to your grant and whose name and contact 
information appears in your Official Award Notification. IMLS may share Interim Performance Reports with grantees, 
potential grantees, and the general public to further the mission of the agency and the development of museum and 
library services. Reports may be distributed in a number of ways and formats, including online. 

14. Recipient Organization: Internet Archive 

15. Project Title: Community Webs: Empowering Public Librarians to Create Community History Web Archives 

16. Project Summary: Community Webs is a two-year project to provide continuing education, training, and 
technology services and infrastructure to enable public librarians to build collections of historically-valuable, web- 
published materials documenting their local communities. Based on a cohort model and including in-person and virtual 
trainings, applied education, iteratively-developed open educational resources, and technology for creating extensive 
born-digital web archives, the project will empower public libraries to serve as leaders in archiving and providing access 
to the web-published records chronicling the lives and accomplishments of their patrons. 


17. Activities 


Activities Proposed in Your 
Application 

Activities Completed 
during the Award Period of Performance 

Explanation of 
Any Variance 

Year 1: Design application 
materials for cohort selection 

The following materials were developed for the application 
process: 



• Web page with project information 

• Blog post 

• Press Release 

• Online Application Form 



Non-web published documents are available as appendices in this 
report. The Press Release is included as Appendix 1 and the Online 
Application Form as Appendix 2. 


Year 1: Develop online course 
space 

Working with project partner WebJunction, an online course space 
was developed to host all program materials and to serve as a 
space for cohort members to communicate. WebJunction staff 
created the space using their learning management platform (a 
Moodle-based system) and created sections for each of the 
planned five curriculum modules and two in-person meetings. 
Internet Archive staff were given administrative access to the 
online course space and added extensive additional learning 
materials to assist cohort members in maximizing their online 
learning experience. Types of resources added include reading 
lists, bibliographies, case studies, instructional policy and workflow 
documents, and video tutorials on using specific software tools. 

The course space also included forums for cohort members and 
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program staff to share information about upcoming conferences, 
educational opportunities, and wikis and forums to reflect on 
curriculum topics and to share press, outreach materials, and 
conference presentations. 

The original course space is restricted to the members of the 
cohort. However, adapted versions of training materials are freely 
available via the program website at 

https://communitvwebs.archive-it.org. This material has also been 


adapted into a public course by WebJunction at 
https://learn.webjunction.org/course/view.php?id=410 that is 
available to any WebJunction user. 

Year 1: Recruit and select cohort 

Through strong promotional help from dozens of groups and lists, 
including PLA, ULC, SAA, ARSL, SHN, DPLA, and others, and direct 
conference outreach, we had an amazing response to our call for 
applications. For the 15 available spots we had 111 applications, 
allowing an acceptance rate of a mere 9%. The applicant pool 
represented public libraries from 35 states and included almost all 
the major metropolitan public library systems. Interest in the 
program far exceeded our expectations and is a testament to the 
critical need the program serves as well as the promotional efforts 
of program staff and the larger community of collaborators. 

Given the demonstrated strong interest in the program, we were 
able to secure additional funding from the Kahle-Austin 

Foundation and subsidize and cost-share additional infrastructure 
resources and professional development funding through Archive- 
It that and allowed us to add an additional 12 cohort member 
libraries to the overall Community Web program. 

Selecting the cohort members was challenging given the large 
number of qualified, excellent applications. Our selection 
considerations included: the diversity of the population served, 
geographical location (we wanted a good mix of rural and urban 
libraries), prior experience with local history collecting and public 
programing, and documented institutional support for program 
participation. With this in mind, the selection committee - 
composed of program staff and partners, as well as Advisory 

Board members - narrowed the candidates down to a top pool 
and then voted on the final members. 

The final cohort membership list is available on the program 
website at https://communitvwebs.archive- 
it.org/participants.html, and in various announcements. The 
Community Webs participating public libraries represent 17 states 
from all regions of the US and serve a diverse constituency of 
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communities both urban and rural in population density and 
profile. The largest library system represented in the cohort is the 
New York Public Library (represented by its Schomburg Center for 
Research in Black Culture), while the smallest comes from the 
Southern Arizona town of Patagonia, with a population of 900. 

The blog post announcing the participating libraries and launch of 
the program can be viewed here: 

http://blog.archive.org/2018/02/28/27-public-libraries-and-the- 



internet-archive-launch-community-webs-for-local-history-web- 



archiving/ 


Year 1: Develop course outlines 
and related materials 

The Community Webs course for cohort members was organized 
into five modules that introduce the concept of web archiving, and 
discuss topics such as collection development, community 
engagement, and marketing and dissemination of local programs 
and their outcomes. The syllabus was designed to be a working 
document meant to act as a general outline. Specific topics 
covered in each module were determined based on interest and 
need of the participants. Course materials have been developed 
by curriculum consultant, Diantha Schull, and the Internet Archive 
Community Webs program staff. 


Years 1 & 2: Participation in 
online forum 

We kicked off the program with cohort members introducing 
themselves via the private online forum located in the 

Webjunction course space. Additionally, each training module 
included at least one forum discussion question. Some example 
discussions included collection development, metadata, advocacy 
and working with the public. 



The forum then served as a platform for knowledge sharing on 
topics covering the spectrum of program activities, from web 
archiving technical specifics to policy formation and public 
programming ideas. In total there were 38 discussion forums with 
130 forum posts by program participants. 


Year 1: Planning and conducting 
in-person kickoff cohort 
meeting and immersive training 

In November 2017, the cohort gathered together at the Internet 
Archive for a kickoff meeting of brainstorming, socializing, and 
discussion of all things web archiving. The day started with a 
welcome and introduction by the Internet Archive founder, 

Brewster Kahle, followed by a program overview by the Project 
Director, Jefferson Bailey. Program staff Maria Praetzellis and 
Diantha Schull presented on current trends in community 
engagement within public libraries and web archiving specifically. 
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Cohort members from the "library leads" participants, San 
Francisco, Cleveland and Queens Public Libraries, shared details on 
their existing local history programs and ideas for collection 
development around web materials. These presentations included 
discussions on building collections documenting their 
demographic diversity or focusing on local issues, such as housing 
availability or changes in community profile. As an example, Abbie 
Zeltzer from the Patagonia Public Library, spoke about the changes 
in her community of 913 residents as the town redevelops a long 
dormant mining industry. Zeltzer expressed her intent to develop 
a web archive documenting this transition and the related 
community reaction and changes. 

Individual slide decks for the meeting are all openly accessible in 
the Community Webs Collection at the Internet Archive and are 
linked below. 

https://archive.org/details/EvolvingPracticesPraetzellis 

https://archive.org/details/CommunityWebsDianthaSchull 

https://archive.org/details/bailey comm webs kickoff 2017 

https://archive.org/details/CommunityWebsNataliel\/lilbrodt 

The program agenda is included as Appendix 3 of this report. 

In addition to presentations and discussions, "Office Hours" were 
held during the meeting allowing time for cohort members to 
work individually with a member of the Archive-lt team regarding 
any technical questions about web archiving. 

A feedback survey was conducted following the meeting to assess 
the value of the meeting and generate ideas for future sessions. 

All respondents rated the meeting as either Excellent (75%) or 
Good (25%). Survey responses are noted below: 

"What did you like most about the meeting?" 

• Getting a firmer understanding of the scope of the work 
we'll be doing and how we can work together as a cohort. 

This is a great group of people. 

• Openly sharing experiences, challenges and ideas (the 
facilitated discussions and conversations throughout the 
day). 

• Networking and meeting the folks from lA. Hearing about 
other institutions hurdles and successes. 

• Getting to meet and brainstorm with other PUBLIC 
LIBRARIANS! 

• Interacting with the archivists/librarians from around the 
country. Seeing the lA headquarters and learning more 
about its mission and projects. 

• The one-on-one tutoring session was really helpful. 
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• 1 liked meeting the staff and other members of the 
cohort.However, it was a great deal to absorb in one day. 

• Lunch was really interesting! 1 liked meeting members of 
my cohort and learning from experienced Archive It users. 

1 also really enjoyed learning more about what Internet 
Archive is up to! 

What would you like us to do differently for next year's meeting? 

• 1 wouldn't mind having the day split into two. 1 felt very 
tired by the end of the day and a bit overwhelmed by all 
the info. 1 think it might be nice to do 1.5 days instead of 

1. Maybe we'll be seasoned pros by this time next year 
though, and 1 day will feel sufficient. 

• 1 thought it was pretty great, to be honest! 

• Additional time to meet with active web archivists to 
discuss strategies and have a bit of hands on training. 

• Next year, we will have conducted some public programs 
and done some outreach. It would be good to have some 
reflection and problem solving exercises based on these 
experiences. 

• Presentation of focused content. Brainstorming is nice, 
but it takes up a lot of time that could be used to "force- 
feed" information. 

• 1 would make it 2 days. For some of us, it was quite a 
distance to travel and it would have been beneficial to run 
a crawl and since it takes 24 hours for it show in Wayback 
Machine, we would be able to discuss the results with 
someone in person. It would also be helpful to have more 
hands on work with the Archive It application, to work on 
scoping and reviewing crawls, etc. 


Year 1: Virtual training and 
module one 

The first virtual training introduced the Community Webs 
program, as well as information on what was expected of cohort 
members throughout the program and what they could expect 
from facilitators and the online course space. A high level 
overview was given along with case studies from Cleveland Public 
Library and East Baton Rouge Parish Library. The session 
concluded with a tour through the online course space. 

Materials developed for training module one have been modified 
and made available to the public on the project website under 
"Web Archiving 101 Curriculum": https://communitywebs.archive- 
it.org/web-archive.html 

A recording of Cleveland Public Library's case study can be viewed 
here: https://archive.org/details/MlCPLv2 

A recording of East Baton Rouge's case study can be viewed here: 
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https://archive.org/details/IVIlEBRPL.v2 


Year 1: Virtual training and 
module two 

The second virtual training explored the issue of collection 
development for web archives specific to public libraries. Case 
studies from the Community Webs cohort members, The 

Schomburg Center for Research in Black Culture, Queens Public 
Library, and Henderson District Public Libraries provided a 
practical look into how libraries are currently handling the 
question of local collection development. Additionally, Internet 
Archive staff presented on their successful model of collaborative 
collection development and demonstrated the tools and methods 
used to allow efficient and scalable public participation in 
crowdsourced projects. The training also explored the intersection 
of local history collecting and the web as a historical resource and 
examined issues in ensuring an equitable archive representing the 
experiences of local citizens. 

Materials developed for training module two have been modified 
and made available to the public on the project website under 
"Collection Development Curriculum": 
https://communitywebs.archive-it.org/collection- 


development.html. 

Recordings from this training include: 

• https://archive.org/details/M2DSCDv2 - Diantha Schull's 
introduction to web archive collection development 

• https://archive.org/details/M2QPL - Web Archive 

Collection Development Policy at Queens Public Library 

• https://archive.org/details/M2HLv2 - Web Archiving at 
Henderson District Public Libraries 

• https://archive.org/details/M2NYPLSCv2 - Web Archiving 
at the Schomburg Center for Research in Black Culture 

• https://archive.org/details/M2CrowdsourceStrategiesv2 - 

Crowd Sourcing Strategies for Web Archive Collection 
Development 

Year 1: Virtual training and 
module three - April 2018 

The third virtual training covered community engagement in local 
history web archive collection building. The Community Webs 
program consultant focusing on curriculum development, Diantha 
Schull, shared her insight into trends and best practices for public 
programming and outreach, and tied her lessons and guidance to 
her recent book, Archives Alive: Expanding Engagement with 

Public Library Archives and Special Collections. Guest speakers 
from the Austin History Center's Community Archivist Program 
talked about their experience working and building trust with local 
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communities, and cohort member Kansas City Public Libraries 
provided a detailed look at what they are doing within the 
community to engage their patrons in their web archive collection 
building project. 

Materials developed for training module three have been 
modified and made available to the public on the project website 
under "Community Engagement": 
https://communitywebs.archive-it.org/community- 


engagement.html 

Recordings from this training include: 

• https://archive.org/details/M3DSCEv2 - Community 
Engagement with Digital History Collections 

• https://archive.org/details/M3CAPAHCv2 - Community 
Archivist Program at the Austin History Center 

• https://archive.org/details/M3KCPLv2 - Web Archiving at 
the Kansas City Public Library 

Year 2: Virtual Training Module 
four - September 2018 

The fourth virtual training covered the topic of sustainability. 
Katherine Skinner of the Educopia Institute joined the webinar to 
discuss her experience working with community content creators, 
sustainability, and strategic planning. Katherine also outlined 
Educopia's recently released Community Cultivation Model which 
is designed to help community leaders grow and sustain healthy 
communities and provide timely advice to the cohort around the 
topic of long term planning and sustainability of the web archiving 
programs. 

Materials developed for training module four have been modified 
and made available via the Community Webs website under 
"Sustainability" https://communitywebs.archive- 
it.org/sustainability.html 

Recordings from this training include: 

• https://archive.org/details/M4Educopia - Educopia 
Institute's Community Cultivation Model, Katherine 

Skinner 


Year 2: Virtual Training Module 
five - February 2019 

The final training module covered the use of web archives to 
support research. Jefferson Bailey, Director of Web Archiving for 
the Internet Archive, kicked off the meeting with an overview of 
research initiatives utilizing web archives. Ian Milligan, Associate 
Professor of History at the University of Waterloo, detailed his 
innovative project, Archives Unleashed, which among other things, 
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is developing "web archive search and data analysis tools to 
enable scholars, librarians and archivists to access, share, and 
investigate recent history since the early days of the World Wide 
Web." Dr. Milligan shared case studies of how today's researchers 
are utilizing web archives and the role that libraries and archives 
have in providing access to this content. 

Materials developed for training module five hav= been modified 
and made available to the public on the project website under 
"Access and Research in Web Archives" 
https://communitywebs.archive-it.org/access research.html 


Recordings from this training include: 

• https://archive.org/details/M5JeffersonBailey - Overview 
of research initiatives utilizing web archives, Jefferson 

Bailey 

• https://archive.org/details/M5lanMilligan - Accessing web 
archives at scale, Ian Milligan 

Year 2: Planning and conducting 
the second in-person meeting - 
November 2018 

Planning for the 2nd in-person meeting included a survey sent to 
cohort members with detailed questions about their experiences 
participating in the program up to that date. These questions 
allowed program administrators to ensure the meeting addressed 
specific concerns and hurdles emerging over the prior year of 
work. Questions included: 

• Please describe your library's approach to collection 
development and policy relating to web archiving. 

• Was your library involved in community outreach 
activities/public programming? If so, please describe the 
impact this had on your web archiving program. 

• Have you (or do you plan to) provide access integration for 
your web archive collections with other library systems? 

For example, incorporating web archive data into your 
library catalog. 

• What aspects of your web archiving program have proven 
successful thus far? 

• What aspects of your program have been the most 
challenging or required re-thinking along the way? 

• What ideas do you have for the future development of 
collecting web based content within the public library? 

• What ways can the cohort support the larger public library 
community to extend the program and encourage other 
libraries to engage in this type of collecting? 

• What would your recommendations be for a public library 
embarking on a new web archiving program? 

• What topics do you think would be most useful to cover in 
the white paper? 
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Responses to this survey were used to help inform the meeting 
program and Year 2 activities and can be found as Appendix 5. 

The second in-person meeting for all cohort members was held in 
Columbus, OH at the Columbus Metropolitan Library. A blog post 
detailing the day is available here: https://archive- 
it.org/blog/post/communing-in-columbus/ . 

This meeting started with presentations by Internet Archive staff 
and included a presentation by Aaron O'Donovan from Columbus 
Public library about their collection development projects and a 
tour of the library. The remainder of the time was spent in 
facilitated discussion groups on the following topics: 

• Program sustainability 

• Technology and tools 

• Collection development and policy 

• Programming and outreach 

A complete schedule of the meeting and discussion group sessions 
can be found as Appendix 6. 

"Office Hours" were also held during the meeting in which cohort 
members could sign up for 15 minute sessions with a member of 
the Archive-lt team to walk through any technical questions about 
web archiving. 

The meeting also featured a round-table session devoted to 
exploring ongoing program improvements beyond the scope of 
the specific grant timeline as well as discussion on matters of 
sustainability, both locally as well as for the program as a whole. 

While participants have program-subsidized ability to continue to 
build and manage collections via Archive-lt beyond the specific 
grant timeline, the end of the grant does mark the end of their 
professional development and public programming grant funding. 

However, a vast majority cohort members expressed an interest in 
continuing to participate in non-collecting based program 
activities beyond the grant end date. Ideas for ongoing 
participation and promotion included: 

• A co-authored journal publication documenting the 
experiences of a select number of participants who were 
new to web archiving and born-digital collection building. 

This paper is underway and due to be completed in Q4 
2019. 

• A co-authored journal publication consisting of 
quantitative analysis of public library web archive 
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collection development in comparison with the collection 
development activities of other types of institutions. This 
paper is also currently underway with multiple cohort 
members and program staff contributing. 

• Continued use of the Community Webs forums and virtual 
communication spaces for arranging joint conference 
presentation submissions. 

• Use of Community Webs communication tools to arrange 
informal meet-ups of cohort members and program staff 
at various conferences and professional events. 

• Participation of cohort members in post-grant additional 
work on program evaluation and meeting on sustainability 
and expansion of the program. 

Overall, the Year 2 in-person meeting was a success in allowing 
program participants the chance to assess their own development 
in learning new professional skills, building and expanding their 
professional networks, engaging in the professional community via 
conference presentations and group trainings, developing new 
technical skills and knowledge of born-digital activities and online 
technologies, engaging in activities such as policy development 
and workflow engineering, and, finally, exploring new methods of 
community engagement, local history documentation, and digital 
preservation and access. 


Year 1: Developing the 
Community Webs Website 

The Community Webs website, at 
https://communitywebs.archive-it.org/, was built with 


information about the program, links to cohort member 
collections, and links to news items (blog posts, conference 
presentations, etc.). 

The website also includes a high level curriculum that mirror the 
training modules. Curriculum sections include videos from online 
meetings and links to reading materials on given subjects. The 
curriculum sections are: 

• Web Archiving 101 

• Collection Development 

• Community Engagement 

• Sustainability 

• Access and Research in Web Archives 

The website will continue to be developed and updated beyond 
the specific timeline of the grant funding. 

Years 1& 2: Local history web 

The cohort has developed a combined total of 703 collections, 
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archive collection building 


including collections developed by Library Lead partners prior to 
the program starting. In year 1 of the program members created 
148 new collections and in year 2 they created 97 new collections, 
totalling over 36 terabytes of data. Each participating library 
produced short narrative summaries at the end of each year 
describing their collecting activities. Reports from Year 1 and Year 
2 are available as Appendix 4. 

Some trends in subject areas of collections and examples include: 
Local News Media 

• Local News, collected by Birmingham Public Library 

• Spanish Newspapers, collected by New Brunswick Free 
Public Library 

• Local News , collected by Brooklyn Public Library 


Local Events 

• Shooting of Alton Sterling, collected by East Baton Rouge 
Public Library 

• July 2018 Rain Event, collected by Marshall Lyon County 
Library 

• Remembering 1 October, collected by the Henderson 
District Public Libraries 

• District of Columbia Elections 2018, collected by DC Public 
Library 

Local Government 

• Government Affairs , collected by Queens Public Library 

• Cuyahoga County Government, collected by Cleveland 
Public Library 

• City of Grand Rapids Committee Agendas and Minutes, 

collected by the Grand Rapids Public Library 

Local Arts and Culture 

• San Diego Visual Artists, collected by San Diego Public 
Library 

• Arts, Music and Authors , collected by Forbes Library 

• Comics Publishers, Artists, and Authors of Washington, 

D.C., collected by DC Public Library 

Local Environmental Issues 

• Environmental , collected by Birmingham Public Library 

• PEPCON disaster, collected by Henderson District Public 
Libraries 

Social Media 

• Social Media, collected by Kansas City Public Library 

• Westborough Social Media collected by Westborough 
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Public Library 

The grant funded, via cost-share, ongoing access to the Archive-lt 
web archiving service for additional years beyond the original two- 
year grant timeline. The Internet Archive expects to continue to 
support program participants in their collecting efforts well into 
the future as well as explore ways to expand access to the 
technical services of the program to additional public libraries. 


Years 1 & 2: Sharing of case 
studies via participation in 
professional conferences and 
blog posts 

Over the past two years cohort members have actively 
participated in professional conferences and presented at many of 
the leading library and archives meetings. Members presented on 
the Community Webs project at the following conferences: 

• Association for Tribal Archives, Libraries, and Museums 

• Endangered Data Week 

• Personal Digital Archiving 

• Society of California Archivists 

• Society of American Archivists 2018 and 2019 

• Alabama Library Association 

• Convention Seguimos Creando Enlaces 

• Texas Conference on Digital Libraries 

• Library Revolution: Urban Librarians Conference 

• New Jersey Library Association (NJLA) Annual Conference 

• Digital Public Library Association Fest 2019 

• American Libraries Association Midwinter 

• Queer History South Conference 

• New England Library Association 

• Midwest Archives Conference 

• International Internet Preservation Consortium 

• Libraries in Technology 

• Museum Association of New York 

• Joint Conference of Librarians of Color 

• New York Public Library's Research Libraries Conference 

Information about the Community Webs program was initially 
shared via several articles published on the Internet Archive blogs. 
Over the course of the project, additional blog posts were written 
for the Archive-lt blog as well as published by participating public 
libraries on their own blogs. For example, in Year 1 East Baton 

Rouge Parish Library wrote a post describing their experience 
creating a web archiving program. In Year 2 Anthony Vaver from 
Westborough Public Library and Dylan Gaffney from Forbes 

Library both wrote blog posts sharing their respective experiences. 

A list of links to blogs, news articles, and conference presentations 
about the project is available on the project website: 
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https://communitvwebs.archive-it.org/news.html. 


Years 1 & 2: Patron engagement 
and local partnership activities 

Many of the Community Webs cohort members have sought to 
engage their patrons in collection development and have 
cultivated local partnerships to support their web archiving 
activities. Some examples: 

• In 2017 and 2018 East Baton Rouge Public Library (EBRPL) 
worked one-on-one with content creators, including a 
local poet, a local potter, a digital media group that 
produces local podcasts, and a local LGBTQAI magazine to 
inform collecting decisions and determine how frequently 
content needs to be captured. EBRPL also solicits patron 
nomination of websites to be archived, and offers classes 
to patrons on web archiving for personal digital archives 
and using the Archive-it database for patron research. In 
2019 a representative from Special Collections attended 
the quarterly Teen Department meeting to share 
information about the EBRPL web archive, which Teen 
Department staff will then share to students during 
classroom talks. 

• Forbes Library reached out to 50 community partners 
(non-profits, fellow library professionals, web designers 
and other community organizers and leaders) for ideas on 
selecting websites to preserve. They used a mix of one-on- 
one meetings and a website nomination form to solicit 
input and recommendations from community partners. 
Forbes Library is also developing a "landing page" for its 
web archive on the library website to help patrons 
discover content within the web archive collections. 

• Birmingham Public Library parlayed an existing 
relationship with The invisible Histories Project into a 
resource for identifying important Alabama based LGBTQ 
resources on the web. With the help of Invisible Histories 
Project staff they were able to locate obscure websites 
covering the LGBTQ in Alabama and were also able to 
identify contacts for sites that had already been taken 
offline. They plan to connect with these contacts to see if 
any copies of the missing sites were preserved. 

• Lawrence Public Library enlisted the help of an intern who 
has been reaching out to community groups via social 
media to solicit URL nominations and to build awareness 
of the project. 
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• Henderson Libraries collaborated with their local city 
government to identify web resources from businesses, 
organizations, and community groups. They also recently 
added a seed URL nomination tool to their website which 
they plan to advertise through local news outlets. 

Overall, cohort members explored a variety of patron engagement 
and local partnership activities from crowdsourcing website 
nominations for archiving, to collection related promotional 
campaigns, to hosting panels and special speaker events related to 
their digital local history activities. The breadth of approaches to 
outreach and engagement were a successful aspect of the 
program's goals of facilitating professional development and 
learning across a variety of activity areas within the framework of 
archiving local history via collecting web-published documentary 
evidence. 


Year 2: OER course is made 
public 

A 2-hour, self paced course titled Web Archiving for Public 

Libraries has been published as open educational resources by 
WebJunction and has been promoted as a resource for the library 
and archives community. 

https://learn. web ju notion.org/course/search.php?search=web-i-ar 


chiving 

WebJunction incorporated video segments recorded during the 5 
online meetings, curriculum materials from the Community Webs 
course modules, and additional videos from cohort members 
developed specifically for this course into easy to follow lessons on 
the following topics: 

• Web Archiving 101 

• Collection Development and Community Engagement 

• Exploring Web Archiving Tools 

• Internet Archive's Archive-lt Service 

• Plan for Sustainability 

Year 2: End of program 
feedback survey 

A feedback survey was conducted at the end of the program with 
the following questions: 

• Please rate your proficiency with web archiving prior to 
participating in the Community Webs program. 

• Please rate your proficiency with web archiving after 
having participated in the Community Webs program. 

• Do you expect to continue web archiving at your 
institution after the Community Webs program ends? 

• What roadblocks, if any, do you foresee for the web 
archiving program at your institution? 
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• How helpful did you find the resources in the Community 
Webs online course? 

• Are there additional resources or areas of focus that could 
have improved the curriculum? 

• Please rate your overall satisfaction with the Community 
Webs program. 

• Please provide brief feedback about your experience 
participating in the Community Webs program. 

Responses were positive overall. When asked to rate their 
satisfaction with the program on a scale of 1-5 (one being not 
satisfied and 5 being very satisfied), 56% rated it a 4 and 44% 
rated it a 5. Complete results of this survey, including written 
responses, are included as Appendix 7. 


Years 1 & 2: The presence of 
public libraries in the web 
archiving community increases. 

Prior to the Community Webs program, public libraries accounted 
for less than 3% of respondents to the 2011, 2013, and 2016 NDSA 
Web Archiving in the United States surveys of institutions 
archiving web-published materials. In 2017 this number increased 
to over 12%. 

Since the Community Webs program started, 35 public libraries 
have contacted Archive-lt, 7 have started trials with the Archive-lt 
service, and 3 have begun using Archive-lt in their institutions. 

Public libraries now make up 7% of all Archive-lt partners (Archive- 
lt is by far the mostly widely used web archiving service per many 
surveys, so provides an authoritative data point in measuring 
institutional representations). This is up from 1.4% prior to the 
Community Webs program. 

The Community Webs program also lead to some participants 
achieving individual professional accomplishments. For instance: 

• A public librarian, Emily Ward of the East Baton Rouge 

Parish Library, was elected as the Vice Chair of the Web 
Archiving Section of SAA in 2018-2019 and will serve as 

Chair in 2019-2020. This is the first time a public librarian 
will be in a leadership position of this group. 

• Makiba Foster, Assistant Chief Librarian at the Schomburg 
Center for Research in Black Culture at NYPL, started the 
#SchomburgSyllabus project as one of her collections 
created via the Community Webs program. This effort lead 
to the Schomberg receiving an $225,000 grant from the 
Mellon Foundation to hire a Digital Archivists to expand 
this work. 

• Some of the collections of DCPL were featured at the 
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Archives Unleashed Hackathon held in Washington D.C. in 
March 2019, proving the value of these collections not just 
for historical preservation, but also in support of data- 
driven computational research. More information on this 
even and its use of Community Webs collections can be 
seen at https://archivesunleashed.org/washington/. 

Laying the groundwork and seeding the capacity for these sorts of 
individual and institutional advancements was an explicit goal of 
the Community Webs program. 


Year 2: Community Webs White 
Paper 

Four cohort members and Internet Archive staff are actively 
collaborating on a white paper that will take a deeper look at web 
archiving from a public library perspective. The paper will cover 
the experience of participants in the program and offer guidance 
on the various challenges and solutions developed locally by 
cohort members as part of their implementation of program 
activities. Once this paper is expected to be submitted public 
library journals in 2019 and will be circulated via the Archive-lt 
website and social media. 



18. Changes 


Type of Change 

Description 

Date of Approval (if applicable) 

There were no changes to the IMLS 
funding or its areas of expenditures. 

However, there was an increase in 
the number of libraries/librarians 
participating in the overall program 
that was funded via internal Internet 
Archive funding and additional 
subsidies. 

The original IMLS funding supported a 
specific number of librarians to participate 
in the program. Given the large number of 
applicants for the few spots in the 
program, additional internal funding was 
secured to support an additional 12 
libraries joining the cohort. Outside 
funding from the Kahle-Austin Foundation 
and from Archive-lt covered this program 
expansion. 

N/A 


19. Results 

a. Agency-Level Goals 

Promote Lifelong Learning 
Build Capacity 
X Increase Public Access 
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Performance 

Measure 

Statement 

Survey 

Respondent 

No. of 

Participants 

No. Total 

Responses 

No. Responses Per Answer Option 

No. Non Responses 

Strongly 

Disagree 

Disagree 

Neither 

Agree nor 
Disagree 

Agree 

Strongly Agree 












For Increase Access projects, proceed directly to Question 19b. 

b. Program and Project-Level Results (for All Projects) 


Intended Result(s) 

Actual Result(s) 

Explanation of Any Variance 

Develop a cohort of 15 public libraries 
with experience building community 
focused web archive collections 

27 public libraries are actively 
building community focused web 
archive collections as part of this 
program. A number of additional 
public libraries were added to the 
cohort (but did not receive the 
grant-funding professional 
development or local programming 
stipends) over the course of the 
project. 

Additional funding from the Kahle- 
Austin foundation increased slots in 
the cohort by 12. 

Over 30 terabytes of web-published 
community history materials will be 
archived in perpetuity 

As of year 2, over 36 terabytes of 
web-published community history 
materials have been archived. 

Additional cohort members account 

for an increase in data archived. 

Thirty presentations about Community 
Webs will be given at national and 
regional conferences. 

Cohort members attended and 
presented at 21 different 
conferences over the 2 year 
project. Many of these conferences 
featured entire conference 
sessions focusing on the 

Community Webs program, 
including at ALA, SAA, DPLAfest, 

IIPC, and others. These panels 
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featured 3-5 presentations each 
from Community Webs 
participants and staff. Thus the 
number of individual presentations 
by Community Webs participants 
was well over 50 presentations at 
local, regional, national, and 
international conferences. 


A suite of open access training materials 
on the topic of digital stewardship and 
local history within public libraries will be 
made available online. 

An OER course has been made 
public via WebJunction and 
curriculum materials as well as 

videos from webinars have been 
made publicly available at 
https://communitywebs.archive- 


it.org/curriculum.html. 


20. Lessons Learned 
Year One: 

Professional development and conference attendance has proven fundamental to the success of the program. The 
public library field can be an isolated, under-resourced community that has few collaborative opportunities for 
professional development and engagement. In providing both opportunities for training and in-person collaboration, the 
cohort model has allowed us to seed a community of practitioners who have been actively spreading their experiences 
with the larger public library community via conference presentations, blog posts and participation in professional 
groups. The effect of this is already noticeable in the increasing numbers of public libraries getting in touch with the 
Internet Archive expressing interest in developing their own web archiving programs. Since commencing the program, 
additional public libraries have reached out to lA directly, and four public libraries have established new web archiving 
initiatives outside of the Community Webs program, a notable increase fueled by the success of the program. 

As described in this report, several innovative collecting ideas and policies have been developed as part of the program. 
These experiences have then inspired other cohort members to model similar collections and policies in their 
institutions. Like in other IMLS-funded initiatives, such as OSSArcFlow, open and transparent sharing of planning, policy, 
and workflow documents across a professional community of shared interests can greatly facilitate local capacity 
building and program growth. What may originally seem very localized or institution-specific collecting activities, can in 
fact have significant impact across a cohort of peers, and give tangible evidence and meaning to otherwise abstract 
principles such as collection development. Having seen the positive ramifications of this knowledge sharing, the program 
team has sought to feature additional curricular content, as well as additional speakers and presenters, to the training 
materials in order to provide even more examples of innovative collection-building projects. Similar innovation has 
occurred in the development of local programming ideas, showing that public programing ideas can also help promote 
collecting. 

The large number of applications to join the Community Webs cohort demonstrates the substantial interest in 
implementing web archiving programs within public libraries. However, the technical, cost, and staff burdens of having 
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a web archiving program has continued to limit the numbers of non-subsidized institutions starting web archiving. lA 
continues to subsidize new programs, including those subscribing to Archive-lt outside the Community Webs program. 
Interest in, and the success of, the program indicates the criticality of preserving community-based, web-published 
historical materials for public libraries. 

Web Archiving remains an activity requiring up-front investment in onboarding and technical training. The pace at which 
cohort members established their web archive programs varied, but in general we found a continued need to provide 
users direct assistance in the process. It is possible that no amount of videos, documentation, or other online, 
asynchronous resources will suffice for some users to get up and running. The specialized trainings and individualized 
assistance provided via Community Webs was essential in helping users feel confident archiving the web and effectively 
use the available technologies to achieve their collecting goals. 

Year Two: 

The range of technical expertise in the Community Webs cohort was broader than we tend to see in users who seek out 
Archive-lt as a web archiving solution. In some cases cohort members had a difficult time keeping up with written and 
recorded training materials and online group trainings. At least one member expressed that they would have preferred 
being able to sit down with someone from Archive-lt in person to walk through processes together. We were made 
aware of this early and took steps to try to allow for more face time between members and Archive-lt staff, including 
holding "office hours" during in-person meetings, a virtual "office hours" session, and scheduling a one-on-one virtual 
meeting with one cohort member who needed additional help. Moving forward we may look into options for service 
levels that offer more direct support to help accommodate users of different technical abilities or that offload some of 
the collection management and crawling/harvesting or quality assurance activities that often were barriers to time- 
strapped participants. 

Most Archive-lt users come to the service with an idea of what they are planning to archive. Community Webs members 
began planning their collections after their subscription year started, which meant that many didn't actually start 
archiving content until well into the first year of the project. Additionally, all libraries were given the same "data 
budget" (512 GB) regardless of their institution's size or geographic location. This "one account size fits all" approach 
made sense in program creation and for grant funding and metrics, but it also evoked anxiety in some participants who 
felt their collecting goals could not meet the data volumes expected in the grant. The data size in websites can also vary 
dramatically, making planning new collections against a specific data budget challenging. Some of the cohort members 
from smaller communities had trouble identifying and capturing 0.5 TB of local history content to archive each year. The 
Community Webs program will need to explore more flexible and iterative ways of approaching aligning collecting 
interests with subsidized data archiving volumes. 

A number of cohort members expressed reticence to actually start and save crawls in Archive-lt because they were 
concerned about things like archiving the right websites, capture quality, or capturing unnecessary or unwanted 
content. Archive-lt staff could have done more in the beginning to help members focus on the big picture without 
getting stuck in some of the minute details. Additionally, more resources can be developed to help guide new users 
through the process of envisioning potential collections, setting benchmarks and realistic expectations around 
workflows and time management, and addressing concerns around quality, completeness and perfection. 

One of the drawbacks to having a dispersed cohort was that their only opportunities to interact in person was once a 
year at our annual meetings or occasionally at other conferences. Outside of those meetings most interaction was 
limited to discussions in course forums, during the virtual trainings and lectures, and on occasional group check-in calls. 

We created a Community Webs Slack instance in an effort to encourage more conversation in an easy to access, chat- 
friendly, informal platform, but its use was limited. In the wrap up survey members identified the cohort community as 
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one of the most significant benefits of the program, so we plan on promoting the Slack channel and adding other 
informal, asynchronous tools as a way to stay connected and continue conversation. 

While the Community Webs program administrators ensured that iterative development and improvement was a part of 
the two-year grant, there remained a need for holistic, external evaluation of achievements and outcomes. Participants 
were very engaged throughout the program and diligent in delivering summary reports in writing and at the in-person 
meetings and in presenting to the community, however the day-to-day achievements of single libraries or librarians 
were often hard to track and aggregate by administrators. The program likely had even greater local and personal 
impact than its specific reported grant-based outcomes and deliverables. Additional evaluative efforts and mechanisms 
in this area would also benefit the program's continuation and growth. Lastly, the program's specific grant structure left 
little time for evaluation of the opportunities and challenges of expansion and little time for program staff to pursue 
strategic partnerships and collaborations. This work is happening outside the specific grant activities, and continuing 
after the grant's end date, but having some of these activities - evaluative and strategic - better integrated into the 
operations of the program would be beneficial. 

21. Next Steps 

The Internet Archive always intended to continue the Community Webs program beyond the specific two-year project 
funded by IMLS, as evidenced both in the post-grant timeline cost-sharing and subsidized technical services and in 
having secured other funding to double the size of the project after the overwhelming number of applicants. Program 
administrators expect "next step" activities to take place in these areas: 

• Ongoing Cohort Collection Development 

o The IMLS and additional funding both support the work of the full cohort of Community Webs 
participating institutions to continue their collection development efforts for 3 more years free of 
charge. This includes a significant volume of data storage and full access to public hosting, preservation 
data for local storage (if desired), unlimited technical support, and the ability to leverage Internet 
Archive and Archive-lt staff and resources and networks in support of publishing, blogging, conference 
presentation, and other outreach and professional service opportunities. This should facilitate adding 
dozens of terabytes to the existing archive of public library curated local history collections. 

• Ongoing Cohort Professional Development 

o As mentioned in this report, both cohort members and program staff are working together on multiple 
publications for professional journals documenting various program outcomes. As well, all forums and 
other communications platforms will continue to be supported by the Internet Archive beyond the 
grant. Program staff will also make themselves available to organize or support conference submissions 
and other marketing efforts of participants. We expect program outreach and communication efforts to 
continue well into the future. 

• Ongoing Cohort Professional Networking 

o Program staff will continue to host informal meetings of participants at various conferences, including 
ad-hoc meetups, dinners, and in-person Q&A and informal training sessions. These activities took place 
at the recent 2019 SAA Conference. 

• Ongoing Program Participant Expansion 

o Many public libraries have reached out to Archive-lt about participation in the program and we have 
allowed many of them to join, including via free and subsidized access to the Archive-lt service. We 
expect to continue these efforts to build the community of public libraries building local history web 
collections. 
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• Ongoing Program Participant Diversification 

o The staff at Internet Archive that developed the Community Webs program is actively exploring ways to 
expand the model to other types of institutions that also have the mandate to preserve web-based 
resources documenting local history. The recent IMLS-funded National Forum grant, Advancing Art 
Libraries and Curated Web Archives, https://archive-it.org/blog/learn-more/art-libraries/, includes a 
focus on how art and museum libraries could pursue similar work through a cohort-based, multi- 
institutional effort at community and capacity building. One can envision the same framework 
underpinning localized preservation efforts via historical societies, community college libraries/archives, 
research libraries, HBCUs, and other affiliated institution types. There are a wealth of institutions 
focused on preserving community records and the model of the Community Web program was designed 
to be extensible and scalable to many institutions and areas of born-digital archiving. Program creators 
will continue efforts to diversify the Community Webs cohort, both its participating librarians and its 
type of organizations. 

• Ongoing Program Evaluation 

o As noted in the Lessons Learned section, efforts are currently underway for higher-level evaluation of 
the program's outcomes, successes, and areas of improvement. We are working on securing funding for 
an external, professional evaluator to focus on this work. Program evaluation will focus on both 
documenting program outcomes as well as recommendations for addressing lessons learned and ways 
to implement corrective adjustments to existing program materials. We expect this evaluation to 
include: 

• External Program Analysis & Measurements 

• Cohort & Program Staff Interviews 

• Evaluation Meetings with Key Program Participants 

• Partnership & Strategic Development Research 

• Recommendations for Community Expansion & Sustainability 

• Ongoing Program Strategic Planning 

o Program administrators have already initiated conversations with the likes of DPLA, Library of Congress, 
and multiple state libraries around ways that collaborating partners can help sustain and expand the 
program. These partnerships can take many potential forms, from data replication to collection 
aggregation to hosting summits and convenings to participating in governance and consortial efforts 
towards a national expansion. Indeed multiple partnerships can help the Community Webs program 
scale and morph in many different ways. As such, program staff intend to pursue additional funding for 
scenario planning, partner recruitment, business development modeling, and other strategic endeavors 
to vastly expand the number of participating librarians, the amount of training and OER materials made 
freely available for professional development, and the scope and volume of born-digital community 
history collections permanently preserved and openly accessible. 

22. Appendices 

The following documents and reports are included to provide additional insight into program and curriculum materials 

developed as part of the program and the current state of participant activities. 

Appendix 1: Press Release. The Call for Applications press release shared on dozens of listservs and channels. 

Appendix 2: Online Application. A copy of the online application for public libraries to participate in Community Webs. 
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Appendix 3: Kickoff Meeting Agenda. The agenda for the Y1 program kickoff meeting in San Francisco, CA 

Appendix 4: Library Summary Updates. Brief progress reports detailing Year 1 & Year 2 activities from all participating 
Community Webs public libraries, both the 15 IMLS-supported libraries and 12 Internet Archive funded libraries. 

Appendix 5: Year 2 Survey. Questions and responses to help inform the year 2 meeting agenda and training modules. 

Appendix 6: Year 2 Meeting Agenda. The agenda for the Y2 program meeting in Columbus, OH. 

Appendix 7: Project Feedback Survey. Final program evaluation wrap-up questions and responses from participants. 
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Appendix 1: Press Release 



Empowering Public Libraries 
to Create Community History 
Web Archives 

CALL FOR APPLICATIONS 

DEADLINE: AUGUST 25, 2017 

Local history collections have long served as vital resources for dynamic communities, but how 
are they adapting to the predominance of web-based publishing? The ability to preserve local 
records from online news, local blogs, regional websites, social media, and other platforms is an 
increasingly important skill for librarians seeking to fulfill their role as information custodians 
and community anchors in the age of the web. 

Web content is inherently ephemeral, with an average lifespan of only ninety days. By including 
web archives in their collecting activities, librarians can play an active role in preserving the 
contemporary records of their communities and ensure that these unique materials are 
accessible for use as a trusted source in the future. 

Interested in learning more? The Internet Archive is accepting applications from public librarians 
for a funded program of continuing education, training, and services in the field of web 
archiving. 


COMMUNITY 
IWEBS 



SCOPE 

• A two year IMLS funded program to provide education, applied training, cohort support, 
and web archiving services for public librarians to develop expertise in web archiving. 

• The project will preserve over 35 terabytes of community heritage materials for 
long-term access 

• Create open educational resources relating to web archiving and digital preservation 

• Explore new forms of local engagement and partnerships through public programming 
support. 

RESOURCES 

• Each library will receive the equivalent of $25,000 in web archiving services, equaling a 
0.50 TB archival data budget per year for five years (extending beyond the two years of 
this project) to build local history web archive collections. This includes access to and 
training on the Archive-lt web application. All content archived in this project will be 
available in perpetuity. 

• Each librarian will also receive $3,500 per year to attend cohort meetings, participate in 
trainings, conferences and professional development, and local events. 





WHAT DOES PARTICIPATION LOOK LIKE? 


• Project activities averaging 2 hours a week in staff time 

• Six project-related virtual trainings (spread over 2 years) 

• In-person cohort meetings (1 per year; costs covered per above) 

• Attendance at a local/regional conference to speak on or promote the project (1 per 
year; costs covered per above) 

• Opportunities to contribute to documentary materials and publish on outcomes (such as 
blogging or writing case studies) 

• Participate in a cohort community via online forum, feedback, etc. 

WHO SHOULD APPLY 

Prior technical knowledge is not required or expected. This program is open to public libraries of 
all sizes and locations. Our goal is to expand the types of institutions collecting web based 
content, so any libraries interested in exploring archiving the web are encouraged to apply. 

Visit us online: https://archiveit.ora/ or to apply Community Webs Application 


PROGRAM PARTNERS 
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INSTITUTE OF MUSEUM AND 
LIBRARY SERVICES nmls.gov) 


A 


ARCHIVE-IT 

(archive-it.org) 



QUEENS PUBLIC LIBRARY 
(queenslibrary.org) 



WEBJUNCTION 


Queens Library 


(webjunction.org) 



SAN FRANCISCO PUBLIC 
LIBRARY (sfpl.org ) 


^ CLEVELAND PUBLIC 
r.,uci,.MRv library ( cpl.org) 


INQUIRIES 


Maria Praetzellis 
Program Manager, 
Archive-lt, 
Internet Archive 
maria (Q)archive.ora 


Sylvie Rollason-Cass 
Web Archivist, 
Archive-lt, 
Internet Archive 
svlvie(Q)archive.ora 













Appendix 2: Online Application Form 



Community Webs: Empowering Public 
Libraries to Create Community History Web 
Archives 


o 

Tr 


The Internet Archive is accepting applications from public librarians interested in participating in a program of continuing 
education and training to enable libraries to build collections of historically-valuable, web published materials 
documenting their local communities. This program is offered by the Internet Archive in partnership with WebJunction, 
and with funding from the Institute of Museum and Library Services to provide subsidized training, professional 
development and web archiving services. Read more about the program: https;//archive-it.org/blog/projects/community- 
webs/. Visit our FAQ page: https://archive-it.org/blog/community-webs-faqs/ 

Please note that there will be an in-person meeting November 2-3,2017 at the Internet Archive headquarters in San 
Francisco. Attendance at this meeting is a core component of the grant so we ask that all applicants be available to 
attend the meeting. Funding to attend this meeting is included in the stipend given to all participants. 

Before completing this application, please: 

1. Read the “What does participation look like* section at https://archive-it.org/blog/projects/community-webs/ and the 
FAQ page at https://archive-it.org/blog/community-webs-faqs/ to make sure you understand the requirements for 
participation. Send any questions to Program Manager Maria Praetzellis at maria(ffiarchive.org. 

2. Review all of the application questions and take time to prepare your responses. This is a competitive application 
process toward the selection of libraries from across the United States. 

3. Submit your completed application by 5:00 p.m. PDT on Friday, August 25. Be sure to press “done" at the end of the 
application to submit your results-you'll know it's worked when you are redirected to a thank you page. 

The information you submit in your application will be accessible to the Community Webs project team. Input from 
applications may be used in project reporting, but will not include any personally identifiable information. 


Email address* 

Valid email address 



This form is collecting email addresses Change settings 


Untitled Title 

Description (optional) 

Name * 

First and last name 


-Short answer text 


Institution * 

Short answer text 


Institution Mailing Address * 

Short answer text 


Phone number* 

Short answer text 



Which of the following positions best describes your role at the library? * 

Director 


Manager or Supervisor 


Librarian 


Assistant or Associate 


■jther 


How many full-time staff are employed at the building/branch where you 
work? 

Short answer text 


Institutional description * 

Please describe your library's experience with local history programming or collecting and any complementary programs 
that have been undertaken. You are not required to have experience in this area to apply. (250 word limit). 

Long answer text 


Demographics of library patrons * 

Please include the following general statistics of community served: population size, racial or ethnic distribution and 
income level. (250 word limit) 



Long answer text 


Please tell us about the community your library serves. What makes it unique and 
how might your library's participation in this project address community learning 
needs? (250 word limit) 

Long answer text 


If selected, can you confirm that your library will sign a lightweight agreement 
with the Internet Archive on expectations and parameters for participation? This 
is a common requirement for subsidized services provided via IMLS funding. 

'C- * 



No 


Unsure 


Do you have regular access to an internet-connected computer or laptop, as well 
as an audio connection (telephone or audio through your computer), to 
participate in online webinars and meetings and to access the online community 
of practice for this project? 



!. ' Yes 


No 

I Maybe 


Will you be able to attend the In-person meeting, November 2-3,2017 at ^he * 
Internet Archive headquarters in San Francisco? 

Yes 


No 

Unsure 


' ) Uther 


How did you hear about the Community Webs project? 

Short answer text 


If selected to participate in this project, you will be your library's primary point of 
contact for all project activities. This will require ongoing accountability as part 
of the Community Webs cohort and project, as well as the facilitation of input 
and decision-making within your library and community to meet project goals. 
Please describe how you (and your supervisor, if applicable) will coordinate your 
responsibilities to be successful in this project. (250 word limit.) 



Long answer text 


Please share any additional questions or thoughts regarding your potential 
participation in Community Webs. 

Long answer text 


















































Appendix 3: Kickoff Meeting Agenda 



Community Webs Meeting 

November 3, 2017 
Internet Archive Headquarters 
San Francisco, CA 


09:00-09:30 Coffee and Light Breakfast 
09:30-10:00 Introduction and remarks, Brewster Kahle 

Overview of Grant Goals & Objectives, Jefferson Bailey 

10:00-10:30 

Trends and Models in Public Library Archives, Diantha Schull 
Evolving Practices in Web Archiving, Maria Praetzellis 
10:30-11:00 Break & Networking 
11:00-12:00 Facilitated Discussion #1 
12:00-13:30 Lunch and tour of the Internet Archive 
13:30-14:45 Panel 

Promising Projects in Digital Collections and Web Archiving 

Mel Gooch, Main Library 5th Floor Manager, San Francisco Public Library 

Natalie Milbrodt, Queens Memory Program Director and Metadata Services Coordinator, Queens 

Library 

Chatham Ewing, Digital Library Strategist, Cleveland Public Library 
Abbie Zeltzer, Librarian, Patagonia Library 
14:45-15:15 Break & Networking 
15:15-16:15 Facilitated Discussion #2 

16:15-17:00 Review of the Day, Open Group Discussion, Next Steps & Activities 
17:00-18:00 Reception at the Internet Archive 



Appendix 4: Library Summary Updates 



Y1 updates from IMLS Libraries 


Birmingham Public Library 1 

Cleveland Public Library 2 

East Baton Rouge Public Library 2 

Forbes Library 3 

Grand Rapids Public Library 4 

LA County Public Library 6 

Marshall Lyon County Library 7 

New Brunswick Free Public Library 8 

Patagonia Library 9 

Queens Library 9 

San Diego Public Library 10 

Schomburg Center for Research in Black Culture, The New York Public Library 11 

San Francisco Public Library 12 

The Urbana Free Library 13 

West Hartford Public Library 14 


Birmingham Public Library 

The Birmingham Pubic Library (BPL) has spent the first year of the IMLS Community Webs project 
determining the library’s subject areas in which to collect, researching and deciding on the best metadata 
to use for web archiving, working with tutorials and test crawls to become proficient in using the 
Archive-It software, and communicating with our community about the project. 

The subject areas BPL will be collecting are: Jefferson County schools and alumni sites; non-profits; local 
elections; Birmingham government; social and cultural; local news; environmental; and LGBTQIA. We 
have been working with our Archives Department who is in contact with citizens who are involved in 
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many of these organizations. The two main groups we have worked with this year have been the 
environmental and LGBTQIA organizations. 

BPL created a working guidelines document titled Birmingham Public Library archiving metadata 
application profile.This profde predominantly follows definitions provided by Descriptive metadata for 
web archiving: recommendations of the OCLC Research Library Partnership Web Archiving Metadata 
Working Group (OCLC data dictionary). 

Working with Archive-It has been challenging due to the nature of the web and determining how best to 
set up the rules to crawl in order to get only the information needed to preserve. Several collections have 
been made public and some are scheduled to be made public. Public collections so far include 
Birmingham AL area K-12 schools, the 2017 Birmingham mayoral race, the BPL collection which 
includes the website and flickr account, LGBTQIA in Alabama, and local news. 

At the Alabama Library Association Convention in April, BPL presented a poster during the poster 
session and spoke with librarians representing colleges, public libraries, and private institutions. A 
librarian with the University of North Alabama discussed using the Community Webs project as an 
example in her classroom and the Fairhope Public Library Foundation was excited to learn that there is a 
way to preserve community websites and plans to reach out to their community to preserve their local 
sites. 

It has been an interesting learning experience working with the Internet Archive and the Community 
Webs group of librarians. Thank you for including BPL in this most worthwhile project. 


Cleveland Public Library 

Narrative 

CPL has used funding from Community Webs to sponsor a local educational program related to lA and 
Web Archiving for a local digitization interest group. We have also used the funds for travel to two 
conferences for the local grant lead here a CPL. 


East Baton Rouge Public Library 

Overview 

The following provides details on activities conducted in the first year of the Community Webs project 
and plans for the future. 

19 Collections 

245.9 GB archived this year 

Collection Development Policy 

Special Collections (SC) created a development policy after realizing the need to set collecting parameters 
and guidelines for both staff and the community it wishes to engage in collecting activities. You can see 
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the most recent version of that policy here . Models for the EBRPL Web Archives Collection 
Development Policy included the Bentley Historical Library Web Archives Collection Development 
Policy and the Collection Development Policy for the East Baton Rouge Parish Library Archives. 

One-on-One Outreach 

Special Collections has made efforts to work with certain content creators on a one-on-one basis in order 
to preserve content that SC deems integral to the historical record of the city. SC has worked with a local 
poet, a local potter, a digital media group that produces and disseminates local podcasts, and a local 
LGBTQAI magazine that recently went to publication. SC spent time getting to know these creators in 
order to make collecting decisions and to learn how frequently sites should be crawled due to updates and 
deletion. SC provides creators with a link to their content from Archive-it that should be stored in with 
their “personal archives.” 

Seed URL Nomination 

SC has created a Google Form that is accessible through the Baton Rouge Room Infosuide. and allows 
patrons to nominate URLs to be crawled. SC plans to create targeted forms to send to 
organizations/individuals for easy URL submission whenever necessary. 

Presentations 

A representative from EBRPL has presented on the topic of creating a community-centric web archive for 
EBRPL at Personal Digital Archiving 2018 in Houston Texas on April 25, 2018, at Texas Conference for 
Digital Libraries on May 17, 2018 in Austin, Texas, and will present at the Louisiana Manuscripts and 
Archives Association Annual Conference in Ruston, Louisiana on October 25, 2018. 

Instruction 

SC has taught two staff classes that included sections on how to use Archive-it for patron research. There 
are plans to teach this course again at the end of the year. Classes taught to patrons also include sections 
on web archiving for personal digital archives and using the Archive-it database for patron research. 


Forbes Library 

Forbes Library has enthusiastically and deliberately approached the creation of a community focused web 
archive. The library was undergoing a Strategic Planning process at the time of the Community Webs 
grant application and built specific goals related to the training of staff in web archiving, identification of 
community partners for help in curating the collection, and building the collection itself into the Strategic 
Plan. We have begun training two additional staff members in Archive-It software, and reached out to 50 
community partners (non-profits, fellow library professionals, web designers and other community 
organizers and leaders) for ideas on websites to preserve. Beginning in February, we used a mix of 
one-on-one meetings and customized emails with a Website Capture google form attached to solicit input 
and have received some great recommendations through the process. Many of the responses fit ideas and 
themes our own staff had identified as topics or sites for preservation, but we were particularly excited for 
sites or themes we hadn't identified in our own internal process. This month, after a long process of test 
crawls we are ramping up our preservation of data from all sites tested, releasing a press release to local 
media and linking the press release and website capture form from our library's front page. We have 
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assigned a staff member to develop a "landing page" for the web archive on our website and hope to try to 
use some creative means to help our patrons discover content within the collection once it goes public. 

Other great news is that I will be presenting a lightning talk at the Public Libraries section of SAA on our 
Community webs curation experience. Tha nk you for your encouragement. I am hoping we will have 
some great feedback from the community in the next few weeks to give me more info to share in those 
few moments. It was great to chat briefly with Karl at NBA in New Haven. 


Grand Rapids Public Library 

Overview 

The Community Webs grant enabled us to start web archiving. Prior to this project, web archiving seemed 
out of reach for us as an institution. The Community Webs grant removed the initial cost barrier and made 
it possible for us to start a long-term project. 

The open nature of the Community Webs program is also very helpful. The work of building a sustainable 
web archiving program does not always follow concrete deadlines, and the flexible nature of the project 
has allowed us to be thoughtful and intentional. Our program 5-10 years down the road will be more 
successful thanks to the time we are taking in getting started now. 

Training in Archive-It and web crawling 

Librarian/manager Julie Tabberer and library assistant Jennifer Andrew are both working with Archive- It 
and completing training videos. We are also discussing using library assistants from other departments to 
aid in adding seeds and metadata and doing quality analysis. 

Press release 

In November 2017, our Communications department sent out a press release announcing the Community 
Webs grant. We received interest from several local reporters and three stories were published: 

- GRPL awarded grant to archive online historical content - WGVU Public Media 

- GRPL will preserve web history with $25K grant - Grand Rapids Press/Mlive Media 

- GRPL wins grant to ‘preserve web materials’ - Grand Rapids Business Journal 

Creating collections 

We started crawling websites almost as soon as our subscription became active. The November cohort 
meeting was helpful in providing a place to talk with other librarians and archivists who are starting out 
(like us) or have been doing web archiving a little while. The advice that stuck with me the most was to 
be careful and thoughtful. A librarian from San Francisco Public Library shared that their web archiving 
program had been done without clear planning and the result was unwieldy, frustrating and difficult to 
use. 

It is tempting to crawl dozens and dozens of websites immediately (the list of sites I want to crawl is 
growing all the time!). Rather than focusing on the amount of data or the number of sites, I am being 
intentional about what we crawl and making sure that we are doing it in a way that makes sense and is 
sustainable. 
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Collections that we have started include: 

• Grand Rapids Public Library - web page, social media, news articles for the library 

• Grand Rapids Public Library - Internal - the intranet of our library, which includes memos, news and 
staff notes. This collection is not meant to be published. 

• News and media - regularly scheduled crawls for two hyperlocal news sources. The Rapidian and Rapid 
Growth 

• West Mich Music Hysterical Society - a grassroots website documenting local music history 

• ArtPrize - international art event that takes place annually in Grand Rapids. The website includes 
extensive information about artists and their works of art, including pictures. 

• Grand Rapids Funeral Homes and Obituaries - a collection of sites publishing local obituaries, many of 
which are no longer printed in the newspaper. Many of these crawls did not work well and need additional 
attention. 

Publishing collections 

We are currently working on adding metadata and descriptions to collections, doing quality analysis and 
then making them public. This portion of the project has led to some larger questions about the best way 
to approach web archiving from a user’s perspective. I am currently researching how web archives are 
used by researchers, and rethinking our approach to securing intellectual rights. Briefly, as an archives we 
should clear intellectual rights to as much of our material as we can, making it possible for researchers to 
use it in their work. That has been part of our function with physical material and our responsibility is no 
different with digital material. A smaller set of material with clear rights statements will be more useful to 
scholars than capturing a lot of content with no rights. 

Community connections 

Our participation in the Community Webs project has been helpful in strengthening connections with 
archival donors. We were approached by three organizations who wanted to ensure that their websites 
would be maintained. Two organizations closed recently and were concerned about what that meant for 
their content online. Another site (West Mich Musical Hysterical Society) has been a labor of love by two 
individuals, and they want to make sure that their work outlasts them. Having the ability to capture these 
websites has been invaluable, and aids in keeping our archive relevant to our community. 

Next steps 

Our priorities for the coming year are: 

• Publish the majority of our collections 

• Create links from our library website and catalog to the Archive-It page 

• Crawl sites for organizations whose physical records we hold 

• Focus on presenting at conferences and hosting programs locally 
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LA County Public Library 

During the first year of participation in the Community Webs cohort, LA County Library has made strides 
to educate internal stakeholders about the purposes and processes of web archiving, begun drafting a 
policy to guide web archiving at the Library, shared lessons learned with attendees of the Society of 
California Archivists' Annual General Meeting, and started developing a web archive that will serve as 
the foundation for a community-shaped web archive resource. 

GETTING STARTED WITH WEB ARCHIVING 

Upon reception of the grant, the Digital Projects Coordinator began working through the educational 
materials presented in the online course website and reading supplemental materials to get up to speed 
with web archiving. She began working with the Archive-It software to familiarize herself with its 
functions and took advantage of the reference materials available on the Archive-It site and the ability to 
contact Archive-It staff for support via the online ticketing system. She attended the first meeting of the 
cohort in San Francisco in October 2017. 

CONVERSATIONS WITH INTERNAE STAKEHOEDERS 

Over the course of the year, the Digital Projects Coordinator has had conversations with various groups 
inside EA County Eibrary to educate them about what web archiving is, make arguments for its relevance 
to the mission and strategic goals of the Eibrary, and seek input on how the Eibrary should build a web 
archive with heavy input from our communities. She wrote a justification memo for the Eibrary's 
Executive Team, led conversations about web archiving at meetings of groups such as the Focal History 
Working Group and the Digitization Steering Committee, and had casual conversations with lots of staff. 

DRAFTING A WEB ARCHIVING POEICY 

Based on her review of the literature, work with the cohort and the grant's educational materials, and 
conversations with stakeholders inside EA County Eibrary, the Digital Projects Coordinator completed a 
draft of a policy that will guide the Eibrary's web archiving activity. The draft policy addresses rationales 
for site selection, web archiving settings (including practices regarding site owners and copyright), and 
preservation and access standards for archived sites. A crucial part of the policy in development is 
building in mechanisms to seek out community input on the contents of the archive in an overtly active 
way that includes education. Given the goals of the grant, the Eibrary hopes to do more than invite 
suggestions for sites or pages to include. The draft policy "especially focuses on education, outreach, and 
facilitating input from historically underrepresented groups and populations, as the online information 
resources created and used by these groups is likely to be especially vulnerable." 

PANEE PRESENTATION AT SOCIETY OF CAEIFORNIA ARCHIVISTS' ANNUAE GENERAE 
MEETING 

The Digital Projects Coordinator attended and participated in a panel with several other cohort members 
at the Society of California Archivists' Annual General Meeting in April 2018, titled Community Webs: 
Empowering Public Eibrarians to Create Community History Web Archives. Her presentation focused on 
lessons from her conversations with internal stakeholders and making arguments for web archiving’s 
utility and meaning for public libraries. For a description of the session, see here . 

BUIEDING A FOUNDATIONAE WEB ARCHIVE 
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The Library is in the midst of creating a foundational web archive that the community will be able to 
expand and shape in the next phase of the grant. It created a list of "candidate sites" identified 
systematically based on their potential to be useful long-term to various groups of users. The groups of 
sites identified were: 

• Sites relevant to any resident of LA County (e.g., sites administered by the county government) 

• Sites relevant to residents of smaller geographic areas within the county (e.g., sites linked from 
webpages of individual LA County Library locations; see the :local links” section under the 
community tab on this page l 

• Sites relevant to users of special collections, especially historical collections, within the Library 
(e.g., sites linked from webpages of LA County Library’s four Ethnic Resource Centers; see the 
“websites” tab on this pagel 

It then refined the initial list of sites by criteria such as site audience, site scope and contents, and 
potential vulnerability. The Library is in the midst of testing crawls of these sites and building the archive 
that will serve as the foundation for the public outreach phase of the grant. 


Marshall Lyon County Library 

Technical challenges have been two-fold: determining frequency and proper scope of different pages and 
working with or around robots.txt exclusions. Although many websites have crawled effectively with no 
special scoping or with built-in scoping, we discovered a few platform-related issues after the fact with 
some of our pages. Determining what platform a website has been built on and what scoping rules might 
apply has been a challenge. We’ve also had issues with site calendars, but the upcoming option of 
Brozzler should help that considerably. 

Likewise, we’ve run into a few platform-specific built-in robots.txt exclusions that we need to either work 
around or ask to be allowed within. Our local county’s website has been particularly challenging. They 
have had difficulties with malicious hots since the 2016 election cycle, and many of the directories that 
allow their website to display properly are blocked to all hots. We are still in communication with them to 
have the Archive.org hot allowed, as we’d prefer not to ignore their exclusions if possible. 

Our collections have been broken down into two areas: the Marshall Community, including events, 
government, and community organizations, and Southwestern MN Libraries, which focuses primarily on 
the libraries of the Plum Creek Library System regional library system. 

For both collections, the relatively small digital footprint of our area has affected our collection 
development. Many libraries have only a very small website on the regional library’s website, offering 
hours, contact options, and other basic information. Many make use of Facebook more than their official 
website, likely due to the relative ease of use of a social media platform rather than a static website. Some 
post frequently to Facebook, but others only a few times a year, if that. Fikewise, many local Marshall 
businesses and organizations have only Facebook pages and/or very brief information available through 
the local chamber of commerce website. Focating websites where accurate information might be available 
has been a challenge. 
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Facebook pages themselves are a frustrating medium to archive and explore. Because of the way they’re 
displayed, finding specific information can be difficult, and related content may be buried somewhere 
inaccessible due to scope limits. An organization that posts frequently may unintentionally bury 
information quickly under other posts. For the sake of data limits, we’re currently only crawling Facebook 
pages once a month, but that may be causing us to miss some of the information we were hoping to 
collect. Brozzler may also help make those pages a bit more accessible via Wayback. 

In the future, we’re hoping to locate and create more opportunities to empower local organizations and 
businesses to get their information out in a simple and effective format. The Marshall-Lyon County 
Library currently hosts a Social Media Breakfast every other month, which is a free event with 
discussions and presentations about effectively using social media as an organization. Flowever, as social 
media has distinct limitations, we’d like to increase access and use of more static web formats. Until that 
happens, we’re considering scanning and uploading local flyers for the sake of archiving them. A great 
number of local events and groups are advertised only on local, physical bulletin boards. 

After a catastrophic rain event last week, we’re also now planning to set up some event-specific 
collections to pull in additional sources and help users quickly locate information on those events in the 
future. 


New Brunswick Free Public Library 

New Brunswick Free Public Library has been actively crawling websites for our Archive. Our board and 
director swiftly approved a paragraph addition to our collection development policy to include our Web 
Archiving policy, which was based off of policies in East Baton Rouge and the Library of Congress. At 
first, we had a team of our library Archivist, Reference Librarian, a Spanish speaking Library Associate 
and myself, meeting to develop a list of organizations we would like to contact to seek permission to 
crawl their website. At our second meeting, and after trials and errors with successfully crawling, we 
decided that we would crawl the sites we decided were relevant and take down the sites upon request. We 
currently have 14 active collections on our page, and one that has no data due to problems saving a video 
that the Help Center said could not be done. We only have 90 GB’s remaining of our yearly budget due to 
one seed, that when initially crawled was under 2 GB’s, and then when scheduled to crawl weekly, 
jumped up to over 100 GB’s. Once that was already saved, I turned off the scan for that seed and reduced 
all of our weekly seeds to monthly to save space. 

Nearly all of our collections are for local non-profit organizations including churches, local free 
newspapers, Hispanic community organizations and the local government website. We also have the 
town’s real estate development organization and articles pertaining to their new buildings. 


Patagonia Library 


Upon cohort participant notification: 

Informed Town Council, Library Board, staff and volunteers of project participation. 
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Submitted information to two local newspapers. 

Two articles published, Patagonia Regional Times and Nogales International 

Invited community nonprofits via email, news article and word of mouth to project informational 

meeting. 

Twelve potential participants attended meeting. 

One on one meetings took place with non-attendees. 

Attended cohort meeting in San Francisco. 

One-on-one online remote training with Sylvie. 

Successfully crawled five websites. 

Unsuccessful attempts with two newspaper articles. 

Explored Metadata options. 

Rewrote collection development policy to include digital material inclusion. 

Completed three learning modules and listened to all webinars. 

Registered for Association of Rural and Small Libraries Conference, I will share project information on a 
one-to-one basis. 

In planning stages to attend Columbus meeting. 

Anecdote: Project participation led to two participating nonprofits to redesign and update their websites. 


Queens Library 

Policy: After attending the Community Webs kick-off meeting in November, we took web archiving 
collection development policies created by other institutions back to Queens Library and used them to 
draft our own. This policy went before our Chief Librarian for approval and then through our Legal 
Department for review and approval. We plan to publish the policy on our website. 

Web Archiving: This was our second year of web archiving at Queens Library. We met with our internal 
curation stakeholders. Government Affairs, Communications and Archives to give them an update on the 
seed crawling we had started for them last year and to capture any changes they would like in this year’s 
seeds in their respective collections on Archive It. The Archives Division made no changes. Government 
Affairs requested a switch from Flickr to Instagram for the feeds we’re capturing from local elected 
officials. Communications added an important new collection by sharing the monthly clippings reports 
they create for our CEO and Board members. We do one-page crawls on these news stories, which is a 
great thing to do right away, especially when capturing video before a news station takes it down, which 
can happen quickly. 

Our two-person web archiving team was able to trouble-shoot some problems we encountered with our 
crawls, and solidify their workflow. The Technical Support Aide on the team (Pablo Jarama) is the person 
who adds the new seeds from the clipping reports we get from Communications, and he starts a test crawl. 
The librarian on the team (Kate Finley) reviews these test crawls once complete and either saves them, or 
makes changes to improve them and runs them again. She found the help feature on the Archive It 
backend very useful, Internet Archives staff was knowledgeable and helpful. The new crawler they 
introduced, Brozzler, was key in capturing those Instagram seeds our Government Affairs Department 
wanted. 
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Public Outreach: Queens Library’s Head of Metadata Services (Natalie Milbrodt), Data Specialist (Kate 
Finley), and Digital Literacy Coordinator (Jeff Lambert), collaborated to develop a two-part workshop for 
graduate students earning their MLIS degrees from Queens College, CUNY. The student chapter of the 
SAA publicized and organized logistics for the workshops. We filled a computer lab (18 participants) 
with students who learned in the first session more generally about web archiving and in the second 
session used a dummy account to take a tour of our Archive It account backend. We demonstrated how to 
set up a new seed and different ways to trouble-shoot a crawl when it doesn’t work the first time. We 
shared information with them about how we curate our crawls, our costs, logistics, and how this work fits 
within our organizational mission. 

Professional Outreach: In October, Natalie co-presented in a session at the Association of Tribal 
Libraries & Museums annual meeting in Albuquerque, New Mexico. In November, Natalie presented to 
other cohort members during the kick-off meeting for Community Webs in San Francisco. In March, Kate 
Finley attended the National Forum on Ethics and Archiving the Web in New York City. In April, Natalie 
co-presented in a session at the Personal Digital Archiving conference in Houston, Texas. In June, Natalie 
co-presented in a session at the New Jersey Library Association Annual meeting. In August, Natalie will 
co-present on the Community Webs panel discussion at the Society of American Archivists meeting in 
Washington D.C. All of these presentations focused on the role public libraries can have in web archiving 
and shared the process Queens Library undertook to launch our web archiving practice. 


San Diego Public Library 

Planned Collections & Activities 

San Diego has over eighty neighborhoods rich with history with many community organizations. For our 
next collection I plan to identify community websites and create a neighborhoods collection that 
showcase the many local organizations from business improvement district organizations to neighborhood 
historical associations to town councils. 

Other topics that affect the San Diego community that we are considering include: homelessness. 

Hepatitis A, LGBTQIA, immigration and gentrification. 

We are also planning to create a web portal promoting our web archives and soliciting topics from the 
public to better reflect the interest of the community. 


In March 2018 I did a presentation about the Community Webs project at the Seguimos Creando Enlaces 
Binational Conference. There was a lot of interest in the project and I met many wonderful colleague 
from Mexico interested in learning more about the project. 


The two current collections that I’m working on include San Diego Visuals Artists and San Diego 
Breweries. 
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San Diego Visual Artists 

San Diego Public Library’s Visual Arts Program began in 1997 and was curated by Mark-Elliot Lugo 
until 2012. The program demonstrated the library's role as a cultural institution embracing a broad range 
of disciplines while helping San Diego's mid-career and older professional artists achieve wider attention. 
In 2013 SDPL opened a dedicated art gallery at the new Central Library, curated by Kara West. This 
collection of websites represents the local artists that have exhibited at the Central Library’s Art Gallery 
since 2013 or at the Pacific Beach Library from 1997 through 2012. This collection is in progress as we 
have identified over 200 San Diego visual artists that we will add to the collection. 

San Diego Breweries 

San Diego is known as America’s beer capital with over 150 craft breweries. San Diego’s craft beer craze 
got its start in the 1980s with the second wave in the mid 1990s through the talent of brewers and their 
willingness to brew what they wanted instead of the market wanted. 


Schomburg Center for Research in Black Culture, The New York Public 
Library 

The awarding of the Community Webs Grant came at pivotal time for our organization. Because the work 
of the Schomburg Center is to collect and preserve content on the global Black experience, we were faced 
with the dilemma of how to do that given the explosion of content on social media and other web 
publishing platforms documenting the lived experience of Black people from around the world. Prior to 
this grant, NYPL had not invested any significant resources of time, staff, or money to explore web 
archiving. Thanks to Community Webs, this grant has revitalized dormant conversations and encouraged 
NYPL’s staff who are web archiving advocates, that the institution might soon begin a sustainable web 
archiving program. 

Project Activities 

The first year of the grant has been more of an opportunity to test out Archive-It and it’s functionality as 
well as educate myself and my colleagues on web archiving. Including the initial partner meeting and the 
web junction modules. I’ve had the opportunity to attend to conferences discussing web archiving, submit 
conference proposals to educate others about the grant and my project, and finally host my own meetings 
with NYPL colleagues about the state of web archiving at our institution. 

• The National Forum on Ethics and Archiving hosted by Rhizome introduced me to various tools like 
web recorder, ethical issues around web archiving, as well as web archiving projects that center the 
community and the historical record. 

• The New York Technical Services Eibrarians hosted a conference Web Archiving: Issues and 
Challenges where I learned from 3 presenters on different web archiving projects at their institutions. The 
discussion provided me with some helpful insights regarding tips and challenges of starting a sustainable 
web archive program. 

• In person training with Archive-It team member Jillian Eohndorf on best practices and using Archive-It 
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• Two conference proposal accepted to discuss Community Webs and the Schomburg Web Archive. 

o Joint Conference of Librarians of Color in September 2018, the poster proposal “Hashtagged 
Syllabi: Web Archiving Race and Social Justice Education.” 

o Collaborating with Maria Praetzellis to present at the International Internet Preservation 
Council in November 2018, session titled “Community Webs: Empowering Public Eibraries to Create 
Community History Web Archives.” 

• I hosted a meeting with a digital archivist from our Archives Unit and our Digital Preservation Manager 
who are both advocates for starting a web archiving program. I was able to add them to our Archive-It 
account so they could test out the platform. They hope to use Schomburg’s Archive- It collections as 
model when they present their case to library administration as to why we urgently need to launch a web 
archiving program. 

• Schomburg so far has 3 collections in Archive It. Currently they are not available to the public: Hashtag 
Syllabus, Schomburg Eivestream Archive, and Digital Schomburg. 

Future Plans & Sustainability 

Plans to do some community outreach with specific groups that are producing content that align with the 
scope of our collection development goals. Also, after hearing about the web archiving project of New 
York Art Resources Consortium (NYARC) I want to consult with the other divisions of the Schomburg 
(Art & Artifacts, Moving Image and Recorded Sound, and Photographs and Prints) on how to approach 
web archiving related to their collecting areas. 


San Francisco Public Library 

We currently have three people working on this grant at San Francisco Public Eibrary: Mel Gooch (Main 
Eibrary 5* Floor Manager), Susan Goldstein (San Francisco City Archivist and Main Eibrary 6* Floor 
Manager), and Herman Wong (San Francisco Documents Fibrarian). In the first year of the grant we 
have focused on training and analyzing how SFPF has used Archive-It over the past 10 years. 

Susan and Mel participated in the in-person training at the Internet Archive in San Francisco in November 
2017. Susan, Mel, and Herman have all engaged with the online learning module, attended the grant 
webinars, as well as other Archive-It webinars. 

Our first project was archiving San Francisco Mayor Ed Fee’s webpages after his death in December 
2017. Herman spent several months working on this collection with support from Sylvie Rollason-Cass 
from Internet Archive. He archived a total of 23.4 GB for this collection. He was able to capture the 
sfgov.org pages as well as Mayor Fee’s Twitter account. Unfortunately there were too many technical 
issues and we were not able to archive Mayor Fee’s Medium page, Instagram feed, and Facebook page. 

In May 2018 Mel participated on a panel with other grant libraries at the Society of California Archivists 
Conference in Yosemite, CA titled “Community Webs: Empowering Public Fibrarians to Create 
Community History Web Archives.” Susan will be moderating a panel at the Society of American 
Archivists Conference in D.C. in August 2018 titled “Community Webs: Empowering Public Eibraries to 
Create Community History Web Archives.” 
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With information learned in the online course and webinars, we are currently developing a collection 
development policy for web archiving at SFPL, creating a URL collection form, and preparing to launch 
our next project which will be a collection archiving San Francisco community groups and organization’s 
Web pages. We will also be working on a separate collection development policy for archiving San 
Francisco Government pages as part of our San Francisco Documents collection. 

In the next few months we will be bringing on additional staff to support web archiving at SFPL. We will 
work together to identify opportunities for programming and outreach to help us create additional 
collections. 


The Urbana Free Library 

At The Urbana Free Library, the initial year of the Community Webs project was spent developing our 
collection development goals and strategies for the web archiving of Champaign County, Illinois. A key 
step toward developing theses goals, as with any form of collection development at my repository is, how 
can we build relationships, and engage those who we want to document, as active participants in the 
documentation process. To assist my institution with moving forward with this step of the Community 
Webs project, I took advantage of my role of archival educator at the University of Illinois at Urbana- 
Champaign, School of Information Sciences, and taught an eight-week course in the spring 2018 semester 
on Community Archives. The course provided 10 graduate students with an introduction to key concepts 
and issues associated with community archives, but with a special focus on introducing them to web 
archiving, and the importance of preserving online local records in the age of the web. 

Throughout the semester, students played an active role in working with organizations that I had 
previously selected for inclusion in our library’s Community Webs initiative, and that represent 
historically significant institutions and anchors in our communities and neighborhoods. Students were 
introduced to the joys and challenges of working with our local community’s stake holders, by building 
relationships and working together in assessing their documentation. Students learned how the 
preservation of web-based content will complement and supplement our library’s existing documentation 
activities, by preserving web-based materials created by the individuals and organizations we already 
actively document. But also how preserving web-content enhances our ability to document those hidden 
voices that are under-represented in our archives. 

Each student selected an organization for their community webs project at the start of the semester, and I 
set up test crawls for each of the sites. Next, students completed an assessment of the existing archival 
documentation of the organization by researching the holdings of the Champaign County Historical 
Archives at The Urbana Free Library, and other area archives and museums. Next, students and I made 
contact with each organization and set up meetings to discuss their archival documentation, web presence, 
and the Community Webs initiative. 

After reviewing the existing archival documentation, web presence, and also insight from each of the 
organizations about how they use and value their online presence, students were provided with a Website 
Appraisal Checklist, to use as a tool to both determine the archival or permanent values of the web 
presence of their assigned organization, and to evaluate the quality of the test crawls. Finally, students 
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were provided with a Finding Aid Template, and using archival descriptive standards, drafted a finding 
aid for the organization’s Community Webs collection. At their final presentation, students presented 
what they learned about the organization, an evaluation of its web presence, addressing relevant portions 
of the finding aid template and the web-appraisal checklist. 

I hope that the procedures developed over this past year can serve as guide to further develop The Urbana 
Free Library’s Community Webs project in the coming years. Some important lessons learned so far are 
that developing relationships with community partners is important, insightful, and time- consuming. The 
web presence of community organizations is multi-faceted. While websites continue to represent an 
important facet of a community’s online presence, social media sites provide many organizations, 
especially non-profits that have an activist mission, the tools to engage with their community that makes 
preserving those sites essential, if we are to capture the true essence of their work and impact. 


West Hartford Public Library 

Since November, when I ran my first crawl, I have spent time learning to use the Archive It software 
running test crawls, saved crawls, deleting crawls and finally have reached a point where I am crawling 
certain sites on regular basis. I am also beginning to use the Q&A to assess the success of my crawls. 

I have created 4 collections for a total of 49.7 G of archived data. I have also engaged with Archive It 
staff as they help me understand my results and how to refine my crawls to obtain the information that I 
desire. The four collections are: 

West Hartford Public Library - to capture the library’s institutional history with a special emphasis on the 
Memories of Migration initiative. 

Town of West Flartford Collection - to act as a readily accessible archive for town projects, events, 
schools. Obtained permission from Town IT department before crawling. 

Regional Projects Affecting West Hartford - to track the major highway project of I84E/W as 2 miles of 
the aging highway are replaced. Spoke with State of Library staff who from the Access Services Unit 
because they work with state agencies and are developing a project to crawl and harvest state websites. 
Discovered that it was acceptable to crawl this public site. 

Town of West Flartford Online News - to provide a more comprehensive picture of the town. Broader 
coverage of business, town events, nonprofits organizations, and thorough articles on actions taken by the 
Town Council and Board of Education. Initially, I asked permission to crawl only the business category 
but the publisher offered the entire online newspaper. 

I have begun to add metadata to all my collections. Having never used metadata, I now view metadata as 
a key component in creating access points to my collections. 

At the library, I have begun discussions with staff regarding how to create an online collection 
development policy that is in sync with the library’s existing materials selection policy and the Eocal 
History collection development policy. 
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Athens Public Library 1 

Brooklyn Public Library 5 

Buffalo & Erie County PubUc Library 6 

Columbus Metropolitan Library 6 

DC Public Library 8 

Denver Public Library 8 

Henderson District Public Libraries 9 

Kansas City Public Library 9 

Lawrence Public Library 11 

Pollard Memorial Library 12 

Westborough Public Library 13 


Athens Public Library 

—Test crawls— 

—25 January— 

543964: Seed: https://www.facebook.com/ACCLHeritageRoom/ 

Blocked by robots.txt. Deleted crawl. 

543969: Seed: http://athenslibrarv.org/athens/ 

Standard crawl; insufficient amount of pages/ links downloaded. 

—7 February— 

548095: Seed: http://athenslibrarv.org/athens/ 

Changed to Standard+ to see if more of the material linked-to gets crawled. The results were better but the 
following sections of the parent web site{http://athenslibrary.org) need to be crawled to capture all of 
this seed:http://www.athenslibrary.org/support-the-library/;http://www.athenslibrary.org/services/; in 
addition to the newsletters stored on libraryaware.com, the Heritage Room’s blogs (Keeping Athens 
Historical and This Day in Athens), and the library’s YouTube page/ videos. Besides the events calendars 
(that is, the Adult Events, Teen Events, Children’s Events pages) not being crawled in either the Standard 
or Standard + versions, the Events and Exhibits page within the Heritage Room section of the suggest that 
we would need to crawl the site monthly. Both this crawl and the previous (543969) have some 
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mysterious hosts showing up. In this one, kaltura.akamaized.net took up more than 29 GB out of 80 total. 
The previous erawl did not eonsume sueh an exeess of data but did show more hosts that eontained Out of 
Seope eontent. Either way, further test crawls will be necessary or we could consider splitting this seed 
into several. 


—15 February— 

550116: Seed: https://keepathenshistorical.podbean.com/ 

Standard. No problems. Also now set as a monthly active seed. 

—23 February— 

551985: Seed: https://accheritage.blogspot.com/ 

Standard+. This approach does seem to capture all of the blog (as compared to the Standard method, as 
seen in the blog crawled for the Rendezvous collection below). It takes up a large amount of data, though, 
so we may need to include this with the videos on the library’s web site in the “maybe” pile. 

Rendezvous With Destiny : 

—Test crawls— 

—10 January 2018— 

539701: Seeds: 

https://www.thepurpleheart.com/roll-of-honor/proFile/default?rID=f47a9594-ee83-47eb-b2b3-26a3ceae47 

de 

This site was mistakenly included; the person in question is not the same John M Atherton. Crawl deleted. 

539696: Seeds: http://elizabethprata.blogspot.eom/2008/09/mr-iere-avers.html 
Same as the regular crawl noted below. 


—24 January 2018— 

543527: Seeds: http://www.legacv.com/obituaries/onlineathens/obituarv.aspx?pid=l71222324 

This Standard crawl got all that we need; probably should be One Page crawl. Another test crawl will be 
done to compare the two versions. 

543535: Seeds: http://www.atlantahistorvcenter.com/research/veterans-historv-proiect/ 

https://www.voutube.com/plavlist?list=PFHODIJfoHwXMkA4eAo6S365KglrORctc3 
Too much data, probably; stopped while in progress and re-done according to Archive-It guidelines about 
You Tube playlists—see below. 

543589: Seeds: https://kaltura.uga.edu/media/t/1 f33m3dhd/ 

Archive-It suggested adding the “/” to the end of this URF. Video does not load in the archived version. 
http://www.grhof com/09%20H%20Randolph%20Holder.htm 
Entire site downloaded. Going to see if this works better as a One Page+ crawl. 
http://hmfa.libs.uga. edu/hmfa/view?docId=ead/ms3192-ead.xml 

Make One Page crawl, though it does not seem that this change will result in different material being 
crawled because the links from the page in the Standard version do not work anyway. 
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543622: Seeds: 

http://www.cobbcat.org/recording-and-saving-veterans-stories-before-thevre-lost/ 

Blocked by robots, txt, do not crawl for now. 

543634: Seeds: 

https://www.voutube.com/plavlist?list=PLHODIJfoHwXMkA4eAo6S365KglrORctc3 
Seems to have captured all of the videos, though when one goes to the page for each video, the videos 
linked-to from those pages also seem to have been captured, though they do not pertain to this playlist 
(e.g. advertisements, spam, random videos). Going to try this as a One Page only. 

543635: Seeds: http://www.atlantahistorvcenter.com/research/veterans-historv-proiect/ 

Video embedded on page initially did not show up on archived version—only a note about not having the 
latest version of Flash installed. On a different computer, however, it worked. Videos linked-to from this 
page are not being capture, so we will try it as a Standard+. 


—25 January 2018— 

543906: Seed: http://www.legacv.com/obituaries/onlineathens/obituarv.aspx?pid=l71222324 

One Page version. I do not see any difference between this crawl and the earlier Standard version. These 

obit pages would thus seem ready to save to the permanent collection via One Page crawls. 

543925: Seed: http://www.grhofcom/09%20FI%20Randolph%20Holder.htm 

One Page+ version. I cannot find any difference between this crawl and the earlier Standard version. They 
both capture more pages from the site than we need. 


—3 February 2018— 

546967: Seed: https://www.voutube.com/plavlist?list=PLF[ODIJfoHwXMkA4eAo6S365KglrORctc3 
One Page version. This version may have captured less superfluous material than the One Page+ 
version—at least the amount of data suggests that. Flowever, the amount of data downloaded seems as 
excessively low as in the previous crawl it was excessively high. Perhaps stick with the recommended 
approach of One Page+ for You Tube playlists, but more test results are needed, because if this playlist 
actually does take up 150-200 gigabytes, we will have to exclude some Athens Library-related videos 
from our collection. 


—7 February 2018— 

548055: Seeds: http://hmfa.libs.uga.edu/hmfa/view?docId=ead/ms3192-ead.xml 

One Page offers same results as Standard; if these Flargrett finding aids are going to be added to our 

collection, then do them as One Page. 

http://www.grhofcom/09%20FI%20Randolph%20F[older.htm 

Same as One Page+ and Standard: the entire site is being downloaded, likely because it’s basic FITML; 
probably not a problem; crawl as One Page. 

http://whitecountvhistoricalsocietv.com/research/general-research-index/wwii-b 

One Page didn’t work here b/c it didn’t capture the images that we want; going to test as One Page+. 

https://www.legacv.com/obituaries/atlanta/obituarv.aspx?pid=153182160 

One Page for these “atlanta” variants of the legacy.com obits, like the other legacy.com pages, seems to 
work as well as Standard. Need to do this and the Radio Flail of Fame pages separately to save them. 
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—15 February 2018— 

550047: Seed: http://whitecountvhistoricalsocietv.com/research/general-research-index/wwii-b/ 

One Page+ version. Still not getting the images, without which there is no point of including this site. 
Going to try as Standard. 

550082: Seed: https://kaltura.uga.edu/media/t/1 _i33m3dhd/ 

One Page version. No difference, except that, returning to the archived page later, an error message 
appears instead of the video. A different, but similar problem happened previously. Further research into 
potential sites suggests that the videos at this site are probably well preserved by UGA institutions. 

550119: Seed: 

http://www.hightowerfuneralhome.com/obituaries/William-Eugene-Gene-Cook?obId=l 348788# 

Another obit variant; One Page. No problem with this crawl. 


550129: Seed: https://www.voutube.com/plavlist?list=PLFIODIJfoFIwXMkA4eAo6S365KglrORctc3 
Previous two YouTube crawls deleted to get a better sense of how much data these videos would take up. 
Forgot to delete crawl that included both this playlist and the Atlanta Flistory Center page, so the result 
was comparable to the One Page crawl previously done. Re-doing below. 

—16 February 2018— 

550325: Seed: https://www.voutube.com/plavlist?list=PLFIODIJfoHwXMkA4eAo6S365KglrORctc3 
Confirmed with this crawl that a One Page will not work for a You Tube playlist. I will do a final test of 
this page as a One Page+ (and following the other recommendations that Archive-It gives for YouTube 
playlists) to confirm the amount of data that these videos will take up. 


—22 Feruary 2018— 

551670: Seed: https://www.voutube.com/plavlist71istoPLHODIJfoFIwXMkA4eAo6S365KglrORctc3 
One Page+. Something weird happened with this craw. Though the main page was downloaded, the links 
to the videos do not work and the URL starts with the following: “s.ytimg.com”. Until we get feedback 
about how much space these videos are taking up (more than 200 GB this time, with more than 100 of 
that being “new data,” we should not crawl this site. 

551708: Seed: http://whitecountvhistoricalsocietv.com/research/general-research-index/wwii-b/ 
Standard. This time, captured the images. Good to go. 

551720: Seed: http://www.todavingeorgiahistorv.org/tags/world-war-ii 

One Page+ crawl did not capture the videos on the pages that are linked-to from the seed. Standard might 
capture them, but given the potential size of these videos they have to be excluded. 

—23 February 2018— 

552019: Seed: http://www.atlantahistorvcenter.com/research/veterans-historv-proiect/ 

Standard+ version. Some of the pages/ videos linked-to are still a little iffy. Might need to consider 
whether this is within the scope of our collection. 
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552032: Seed: http://www.grhof.com/09%20H%20Randolph%20Holder.htm 
Crawled separately so that it can be saved. 

^ ^ ^ ^ ^ ^ 

In addition to the saved test crawls (highlighted above) the obituaries listed in the potential-crawls 
document have been checked again to see that the links work and to confirm (as far as I can tell) that they 
match the persons interviewed, so that they can be all be crawled once (that is, we’re assuming that the 
content will not change). 

***10 January 2018*** 

539698: Seeds: https://ss8h9.wordpress.com/ 

No problems with this crawl. 

http://elizabethprata.blogspot.eom/2008/09/mr-iere-avers.html 

For future crawls with blogspot.com URLs, we might want to limit the crawl to a greater extent than we 
did here—that is, it probably should have been a One Page crawl; that said, a Standard crawl like this one 
does not capture the blog’s entire contents, only the other blog posts from the same month that the 
crawled page is from, in this case September 2008; arguably, a glimpse of the rest of the blog’s contents 
provides context for the page that pertains to the collection—in this case, that argument’s quite a stretch 
given the content of most of the blog, which is why it probably should have been a One Page. 

Brooklyn Public Library 

Here’s a quick rundown of what I’ve done so far: 

-Canvassed BPL staff and collected their suggestions for sites to collect and ways to shape the web 
archive 

-Started building a list of sites and ideas for collections 
-Created collections in Archive-It and started test crawls 

-Started reaching out to some groups whose websites we are archiving—^but not having much luck with 
that so far (only one group has gotten back to us) 

-Hosted a panel discussion on “Saving Local News on the Web” (I’m sure you saw the write-up on the 
Archive-It blog) 

-Presented at the Urban Librarians Unite Conference with fellow cohort member Jacquelyn Oshman 
-After a lot of dithering, starting actually saving some crawls! (Still not feeling 100% confident about my 
ability to assess a crawl. I just saved one, only to notice almost all of the data was video that won’t play 
for some reason.. .might need help with that one). 

-Made my collections and crawls public, and started adding metadata. 

Coming up: 

-Presenting at the NJLA conference with Jacquelyn and Natalie Milbrodt (June 1st) 

-Presenting at SAA with a panel of cohort members (August) 

-Continuing to build our collections and add metadata 
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-At NEA I had the idea of a get-together for NYC-area web archivers (potentially to form a 
cohort/coalition/community) and Karl said he would try to pull it together, so we’ll see! 

Goals: 

-Continue to build partnerships within BPL. Especially hoping to get more buy-in from BPE so we can 
continue to fund this project after Community Webs has finished (this is a focus of mine for year two). 
-Continue reaching out to organizations whose sites we are saving to notify them/hopefully build 
partnerships and relationships. 

-Host another event or two to build awareness in the community 
Ideas: 

Maybe the cohort could collaborate on a toolkit? “Web Archiving for Public Eibraries,” or something 
along those lines? Just an idea my boss and I had as I try to remember to document my own successes, 
failures, and lessons through this process. Or maybe it would make sense for cohort members with similar 
focuses (like we are focused on local history, for example) to collaborate on toolkits for their specific 
focus areas. 

Buffalo & Erie County Public Library 

Buffalo & Erie County Public Eibrary has been focusing attention on the rebuilding of Buffalo and the 
surrounding areas. Currently Buffalo is undergoing a re-birth of buildings, business, and culture. We have 
been focusing our attention for the grant on documenting all of the exciting changes in our area. Our 
collections are: Government, Education, Refugee/Immigrant Population, Festivals and Fun, Infrastructure, 
Education, and the Eibrary System. So much is happening in our area at a grassroots level that we want to 
make sure their activities are preserved as they might not have the means to do it on their own. The 
Community Webs grant has allowed our Eibrary to be able to capture a snapshot of time for preservation. 

Columbus Metropolitan Library 

Activities 

Since October 1, 2017 Columbus Metropolitan Eibrary (CME) has archived 303 GB of data through the 
Internet Archive using the Archive-It web archiving application. CME currently has two collections in the 
Internet Archive: 

1) Columbus Websites; which is made up of government and philanthropic organizations in Columbus 

2) Immigrant Experience, which is a collection of websites devoted to providing resources and telling 
the story of recent immigrants to Columbus. 

Currently CME has 8 active seeds, crawling 8 websites monthly that have produced over 6 million 
documents that have been preserved, and made accessible through the Wayback Machine. 

Current seeds in Columbus Websites: 

http://columbuslandmarks.org/ 

Advocating, promoting, preserving Columbus Ohio's historic architecture and honoring excellence in 
urban design. 

https://columbusfoundation.org/ 
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The Columbus Foundation is the trusted philanthropic advisor to 2500 individuals, families, and 

businesses who have created charitable funds. 

https://www.columbus.gov/ 

City information, departments, events, reports, and elected officials. 

Current seeds in Immigrant Experience: 

http://ohiohispaniccoalition.org/ 

The mission of the Ohio Flispanic Coalition (OFICO) is to improve the well-being and quality of life for 
all Flispanics/Latinos through advocacy, education, training, and access to quality services. 
http://www.crisohio.org/ 

Community Refugee & Immigration Services (CRIS) is an independent non-profit organization that 
serves the growing refugee and immigrant populations in Central Ohio. 
http://www.somaliohio.org/ 

The Somali Community Association of Ohio (SCAO) is an officially recognized 501 (c)(3) non-profit 
organization founded by Somali community immigrants in 1996. SCAO was the first grass-roots Somali 
organization in Ohio, and the first to provide services to the Somali community. 
https://www.bnccohio.org/ 

The Bhutanese Nepali Community of Columbus (BNCC) is a community-based non-profit organization. 
It is a 501(c)(3). BNCC is dedicated to helping newly resettled Bhutanese refugee population from Nepal. 
https://u.osu.edu/dimelocolumbus/ 

jDimelo, Columbus! Is a collaboration between Ohio State’s Office of Diversity and Inclusion’s ,[,Que 
Pasa OSU? and WOSU’s Columbus Neighborhoods produced by Leticia Rose Wiggins. Ohio’s Latino 
population is increasing - with Franklin County experiencing the most significant growth during the past 
decade. While the Census Bureau documents this expanding community using the blanket term 
“Hispanic,” the word itself accounts for a myriad of identities such as Chicano, Colombian, Cuban, 
Mexican, Mexican American, and Puerto Rican. 

Going forward we have discussed adding a Genealogy collection where we would archive important sites 
in Central Ohio that are important to genealogists. In particular, funeral home websites are becoming 
increasingly important to archive since obituaries are now stored digitally, often replacing traditional 
obituaries that were once printed in the newspaper. Also of interest is to archive local genealogy websites 
that publish their program information and newsletters only online. As we become an increasingly 
paperless society, expansion of projects like Archive-It will become more important for archiving and 
accessibility of bom digital material. 


DC Public Library 

To date, DC Public Library has created seven collections as part of the Community Webs program: DCist 
Archives; Neighborhood Blogs of Washington, DC; Comics Publishers, Artists, and Authors of 
Washington, DC: DC Punk Web Archive; Advisory Neighborhood Commissions of Washington, DC; 
Ethiopian-American Culture in Washington, DC; and the in-development Interactive Fiction Web 
Archive. In order to reach our goal of seed-level metadata for all collections, a pilot program to involve 
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branch librarians in metadata creation was successfully enacted and will be expanded in the near future. 
Other milestones reached include the approval of a collection development policy for web materials and 
the creation of a suggestion box mechanism for both website owners and the general public to expand the 
voices involved in the selection process. 


Denver Public Library 

At Denver Public Library, we have been gaining some confidence about identifying sites to crawl and 
making a commitment to crawl them regularly. I am still the only staff person working on this project, 
though I have received interest and some feedback from my colleagues. For the first few months of the 
project, I was reluctant to schedule crawls until we had a sort of master collecting plan. Now, half a year 
into the project, it is getting easier to view collecting web content as merely an extension of our physical 
collecting. We make our best effort to collect what we think researchers in the future will find important, 
and we sometimes nail it, and probably sometimes miss it - our Archive-It content will probably be no 
different. 

Initial collecting areas to focus on for Archive-It we identified are: 

Archive-It collections structured to mirror our own library collections 
Neighborhood newspapers - those that have a print presence or just an online presence 
Local events websites - festivals, concerts unique to Denver and its neighborhoods 
Local political races 

Library website, staff-generated content and blogs (still have had trouble capturing all blogs, in a couple 
test crawls we got inconsistent results) 

Sites of partner organizations 

Some websites of organizations whose papers we collect (there are thousands of crawls to consider here, 
so I will use my best judgment to identify those that are most important and research-valuable) 

Questions that have surfaced about the possibilities of collecting web content: 

Could local newspaper sites on Wayback replace our need to collect local neighborhood 
newspapers/newsletters? 

Could adding an occasional crawl of organizations’ websites be part of our terms of an accession plan or 
even a Deed of Gift for some collections? 

Could adding a thorough one-time crawl of an organization’s web content be a means to replace some 
extent of a physical manuscript donation? 

How can we help in-house researchers begin to think of Wayback as a necessary research tool? It is not 
on most casual researchers’ radar yet. 

Henderson District Public Libraries 


Henderson Libraries has created five active collections as of April 2018. 

- Henderson Libraries Web Archive includes the Libraries’ social media accounts (mainly Facebook, 
Twitter, and Instagram), press related to the Libraries, and the Libraries’ website 
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- Henderson City Council includes press related to the City of Henderson, elections, and the State of the 
Union address on YouTube 

- Remembering 1 October includes news articles on the event, survivor stories, memorials for victims, 
and resources for community members that want to help 

- March On with Mayor Debra March solely focuses on capturing video recordings of the monthly 
community event series highlighting a different theme or topic of interest to Henderson residents 

- Henderson Community Developmen t includes press related to any redevelopment in Henderson, 
Henderson Strong website (a citywide planning document that communicates the vision, long-term goals 
and objectives for Henderson), and the Community Development website 

Currently, we are working to engage the Henderson Libraries’ staff with the project before we aim our 
efforts at community engagement. We want the staff to feel comfortable enough on the subject before we 
launch any initiatives. In early March, we assembled a committee that includes City of Henderson 
employees including the Assistant City Manager and Redevelopment Manager, Archives and Records 
Manager, City Clerk, Public Engagement Specialist, and the Director of Public Works, Parks and 
Recreation. This committee will help bolster the projects publicity through partnerships within the 
community and will create more collection ideas. 

We’re hosting a Community Scanning Day on April 28 and we’re passing out general brochures on the 
project to attendees. In the future, we want our community engagement to include a way for the public to 
nominate seeds. 

We hope to have a complete collection development policy in regards to the web archive sometime in the 
next few months. 

Henderson Libraries participated in a panel discussion at the Society of California Archivist’s general 
meeting on April 12 in Yosemite National Park along with cohort members from San Francisco Public 
Library, Sonoma County Public Library, and County of Los Angeles Public Library. We also participated 
in a THATcamp workshop with web archivist Sylvie Rollason-Cass at the National Council on Public 
History on April 18 in Las Vegas, Nevada. 

Kansas City Public Library 

The Kansas City Public Library has been an active and engaged participant in the Community Webs 
program as a member of the expanded cohort since Fall of 2017. Our work has been threefold: 

1. Educational activities around web archiving and the Archive-It software 

2. Creating collections, running crawls and fine tuning results 

3. Engaging staff and the community around websites and topics to include 

I have participated in the three webinars for the cohort and taken the associated course sections on the 
WebJunction site. I presented about our plans for community and staff engagement during the module 3 
meeting. I also attended the November in-person meeting at the Internet Archive. 

I began running crawls as early as October 2017 in order to test my working knowledge of Archive-It. 
This turned out to be much more hands-on and time consuming than I anticipated as almost every site 
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required multiple test crawls to avoid crawl traps and other unwanted content. At some point I reached a 
happy medium between creating the perfect crawl and the amount of time I was spending on each seed. I 
still routinely start with test crawls hut have dramatically lessened the time it takes to come to a 
successfully captured site. 

I engaged a small working group of library staff to help work through the focus of our web archiving 
efforts. As a group we took a community-focused view of web archiving rather than one that dovetails 
neatly with existing collecting practices in special collections. We developed some simple guidelines and 
collecting principles based on the following: 

Prerequisites for Inclusion 

1. Ownership and management of the website is based in the Kansas City metropolitan area. 

2. Site documents the government, culture, everyday life, politics or business climate of Kansas City. 

3. Site offers utility for future research and understanding of this era in Kansas City history. 

Factors Working Against Inclusion 

1. Content is duplicated in other locations or other formats 

2. Archiving responsibilities more appropriately lie with the owning organization or another entity 


Factors Working in Favor of Inclusion 

1. Site documents stories, news and activities within a particular neighborhood or section of the City, 
especially the urban core. 

2. Site showcases unmediated personal viewpoints of Kansas City residents. 

3. Longevity - length of time site has been operating 

4. The site has an identifiable following or is a recognized authority or expert 

5. Community Impact - site has affected change 

6. Site is at risk of being lost due to neglect, finances, technology, age or ownership/management. 

7. Site documents stories, news and activities of a particular identity group, ideology or community 
of practice likely to be overlooked by traditional media entities. 

Mission 

The Kansas City Public Library Community Web Archive provides ongoing access to content published 
on the Web that documents the activities of everyday Kansas City residents. The Archive focuses on 
at-risk content with a neighborhood-level orientation. 

Goals 

The goals of the KCPL Web Archive is to establish an ongoing, collaborative program of assessment and 
curation of web content that provides value to Kansas City residents, researchers, historians and students. 

Archive-It Collections 
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1. Hyper-local Media 

2. Politics & Activism 

3. Arts & Culture 

4. Business/Entrepreneurship 

5. Neighborhoods 

6. Personal Blogs 

7. Social Media 

As of this writing, we have saved 41.1GB of data for 46 seeds. We have identified 141 total sites so far 
that will be included in the archive. We have begun the process of soliciting nominations from staff and 
the public for other sites that should be included. This is taking place through a web form and in-person at 
library events and community meetings. 

The remainder of this calendar year will be spent crawling in earnest at a larger scale, and processing 
feedback from the public. As the archive grows in scope and scale in year two, our plans are to engage the 
local media, target specific organizations whose sites are being crawled, and to develop some 
programming around web archiving. 


Lawrence Public Library 

We're currently collecting within three collections: Local Government, Lawrence Public Library, and Arts 
and Culture. The holdings in the Local Government collection focus on the website of the City of 
Lawrence, Kansas, and in particular the agendas and minutes of the City Commission. We've also 
collected the available websites of the candidates who ran for City Commission seats last November. 
Regarding the Lawrence Public Library collection, our library recently migrated to a new website, so this 
collection documents our website prior to the migration. The holdings in the Arts and Culture collection 
are currently minimal, but reflect the steepest learning curve. We worked with Internet Archive staff to 
find a solution to archiving a series of multimedia resources documenting local historic homes. Solution 
now in hand, we'll begin archiving a selection of those resources. 

We've also been working on a collection development policy for digital local history collecting in general, 
with web archiving included as a subset. 

This summer, we'll host an intern whose project will involve planning and promoting a community 
digitization event, as well as gathering community input regarding the web resources that reflect our 
community. 
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Pollard Memorial Library 

Since the cohort meeting, we have been working on creating a collection development policy for our 
library and community website. It became obvious to us that it would easy to collect too many sites and 
use our allotment quickly. We evaluated various sites that the City Library for Lowell should maintain for 
access by our patrons. The policy is in its final draft format and will be presented to the trustees of the 
library in the fall for approval. 

During the winter, we experienced a broken pipe/heating system, which delayed our focus on this project 
for nearly two months while we were in preservation and clean-up mode. This destruction and subsequent 
clean-up made it apparent the need to capture community events of relevance. A capture was done on 
coverage for the library flood from various sites to see how it displayed, worked, and captured community 
information. 

Recently, we have been running crawls and other tests for sites that we have identified as the primary 
layer of sites to collect. Members of the staff have been meeting as part of an archives community group 
of various local organizations to determine what the primary goals of preservation for the community are 
and figuring out what we can capture to assist in the digital environment. We are creating a listing of 
partner suggested sites to consider adding to the Community Webs progress going forward and also 
looking at local issues, festivals, celebrations, and discussions to consider capturing. We are doing 
research to determine which items are already captured via other media or archival programs. 

We have begun changing certain crawls to “public” but are hoping to test our process for a large scale 
community event, our Folk Festival, which is going to occur at the end of July. 

Our marketing and promotion schedule for this archive will start in September, with encouragement from 
the community to make recommendations for the capturing of community-focused websites. It is our 
expectation to have both a passive submission form as well as reach out to talk about this opportunity as 
well as sharing it out via social media, our webpage, e-news and through media outlets during American 
Archives Month in October. Our library’s website is currently under development for a new platform, so 
it’s our expectation that we will present this on a new page just highlighting community webs and 
encouraging feedback and suggestions. If possible, we will also create a page linking to current webpages 
already captured by the wayback machine in order to eliminate those suggestions and allow patrons to 
explore past iterations of those pages. 


Westborough Public Library 

Here are the collections that I have created: 

-Westborough Celebrations and Events - This collection includes website and Facebook activity related to 
our town's 300th anniversary celebration last year. I only have a small collection of physical documents 
and objects that I was able to pull together throughout that year, so this Archive-It collection captures and 
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preserves much of the digital data that most assuredly would have disappeared had I not grabbed it now. 
We have other physical collections relating to past Westborough celebrations, so this collection will allow 
us to continue to build on this theme. 

-Westborough Sports -1 created this category to capture the founding of the cricket team that was newly 
created last year, and then threw in a couple other websites relating to other sporting leagues as test sites. 
The cricket team, however, is the most important one to collect right now, because it serves as a sign of 
our changing demographics in town, which includes a fast-growing South Asian population. 

-Westborough Town Government - Our physical collections include town records going back to the 
eighteenth century, so I thought that capturing the digital manifestation of our town government as 
presented to the people was important. 

-Westborough News - This collection is actually empty as of now. The captures were huge, so I need to 
think long and hard about whether or not it is worth devoting space to this category. But as I am writing 
this, I had thought that I had continued to capture obituaries in newspapers, since I get that question a lot, 
but apparently I never saved my test crawls or scheduled regular crawls. So I created another set of test 
crawls for obituaries, and if they are not huge, then I will schedule them for regular collection. 

These collections, in the end, are still experimental, because I want to incorporate some of the reaching 
out to the community that you are advocating before solidifying and adding more. A few circumstances 
have delayed this process, though. In planning out our collection development approach, I wanted to 
include both our library director and our IT person, but before I could put together this "committee" our 
IT person retired, so I had to wait for us to hire a new one and for her to settle into her job before bringing 
her into this discussion. Once we were able to meet, we decided that I should review our current 
collection development policy and see if it can serve as a guide for the collection of websites, and then 
from there dole out "collecting responsibilities" for identifying websites to each of us. 

In reviewing our collection development policy, while I believed that it could serve as a guide for 
collecting websites simply by adding "websites" to the formats that we collect, I began to wonder if our 
traditional approach to collecting in general is too "old school." My room looks pretty "19th century," and 
while I want to be mindful of building continuity between past and present, are we at a critical moment 
where a paradigm shift is in order? I had just read an op-ed by Thomas Friedman in the NYTimes where 
he put forward the idea that we are only in the second inning of the Digital Revolution, and news about 
Facebook was just beginning to dominate the airways. Is the local history collecting strategy that was 
developed during a time of rapid changes coming out of the Industrial Revolution still appropriate for our 
time? Do we need to reconceive what a Local History Program is, and rethink what and HOW we collect? 
The answer is obvious. 

Our committee did not see the value of creating an advisory board for collecting websites. Our town is too 
small for such an entity, and it would necessarily slow down our ability to collect (with perhaps the fact 
that we are a small town, where a few people can have strong ideas and influence over how things should 
be, being one of the ways that an advisory board could prove to be more detrimental than helpful). But we 
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still want to involve our community in deciding what we want our Local History Program to be. Can we 
create programs where we can solicit this feedback, but make it interesting and beneficial to the people 
who participate? 

It turns out that our library is on the verge of going through a whole re-branding exercise. We have hired a 
consultant to help us with this process, and it is an opportunity for us to engage with the community, point 
the library in a new direction that will better benefit all of us, and then communicate this new vision in a 
way that resonates with our community. Local History will be a part of this process, and we will use it as 
an opportunity to set it on a new "collecting" path. All of this is the good news. The bad news is that it 
delays me being able to begin starting seeds, collecting websites, and adding metadata even more. So 
while my Archive-It account seems rather sparse at the moment, it does not reflect the profound changes 
that are happening here in my library. At least not yet! 

In the midst of all of this thinking, the Archive-It part suddenly seems very small. We are no longer 
coming up with a collecting strategy for websites, but reconceiving why we collect, how we collect, and 
what we collect. Archive-It is only a part of this whole strategy. Then again, without it as a component or 
collecting tool, we can't even begin to start asking these questions. 
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Birnninghann Public Library 

During the first year of our Community Webs project, we familiarized ourselves with the Archive-It 
software, participated in online training, began building our collections, and met face to face with all the 
cohorts at the Internet Archive’s headquarters. We focused on collecting and saving BPL’s websites, the 
city government’s websites, and as many of the public and private school sites located within the county. 

Our second year has been spent creating metadata for each seed as well as continuing to crawl and 
preserve websites. We are fortunate to have a retired cataloger who comes in weekly to create metadata 
for our project. Also, and of utmost importance this second year has been spent locating and contacting 
liaisons within the different communities to partner with us in saving their websites and social media. We 
realize that our archiving project will only thrive when we include community members as part of the 
process. 

As an example of community members participating, let me share with you one example of a community 
we worked with successfully. That is Alabama’s LGBTQ community. Our archives department was 
already collecting in this area and we wanted to make sure we collected digitally bom data too. We 
already had an established relationship with The Invisible Histories Project to be a repository for local 
LGBQT materials. This organization provides a connection between Queer communities and local 
repositories for the preservation of the history of LGBTQ life throughout the American South. With their 
staff, we were able to not only locate obscure websites covering the LGBQT in Alabama but were also 
able to locate contacts for sites that had already been taken offline. Plans are to connect and see if backup 
copies of these were preserved and if we can gain access to them. 

Our archivists and I participated in the first annual Queer History South Conference where we shared 
information about our Community Webs project and our efforts to preserve website and social media sites 
from the LGBQT community. We believe this exposure will help us to locate more sites that have yet to 
be collected. 

These past two years through our community webs project, we have identified 200 urls and captured and 
saved 923 GB of data and over 12 million documents. 

There have been challenges along the way. The web does remain “a mess” and some sites are difficult to 
crawl and capture. Sites created using Wix have proven to be the most fhistrating for us. Archive-IT staff 
has worked with us to make these sites easier to capture with scoping tools but it’s still a work in 
progress. 

These bom-digital collections complement and supplement our existing physical collections and our plan 
for the next 3 years is to continue reaching out to the public and asking for help identifying online sites 
that should be preserved. We plan to look into the multiple ways we can integrate the data into our 
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databases and websites. We’re discussing the use of landing pages, links, search boxes, and OAI to make 
the Archive-It metadata available for harvesting into our online catalog. 

It has been a privilege and an honor to work with the staff at the Internet Archive and with our public 
library cohorts from across the country these past two years. We hope our participation will be of benefit 
to the public library community. 


Cleveland Public Library 


lA sponsorship of Internet Archiving for Cleveland Public Library has been a success. We have 
established and expanded our Internet Archiving activities including establishing web archives of at-risk 
sites, web archives focused on topical subjects, and we archives that captured sites in line with our 
collection development policy. 

Funds were used for: 

Travel to and housing for staff for lA meetings 
Travel to meetings in Chicago, Columbus, and Texas 

Sponsorship for staff development activities with NEWDL, a NorthEast Ohio Group 

There are remaining funds. 

We will be using remaining funds to either 

A) Do a staff development course with NetEab 

B) Establish a workstation here in the library populated with web archiving software tools for training and 
development of staff 

C) Enroll staff in an SAA webinar on Web Archiving 

D) Teen outreach through partnering with local high schools 


East Baton Rouge Public Library 

Overview: 

The following provides details on activities conducted in the second year of the Community Webs project 
and plans for the future. 

27 Collections 

316.1 GB archived this year 

Collection Development Policy 
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In 2019, Special Collections spent time refining their collection development policy in order to fine-tune 
their collecting practices regarding web archives. The new version of the policy addresses appraisal and 
re-appraisal, rights and robots.txt rules, and selection criteria. 

East Baton Rouse Parish Web Archives Collection Development Policy v.3 

Outreach: 

Special Collections has made efforts to reach out to both internal and external stakeholders in 2019. A 
representative for Special Collections attended the quarterly Teen Department meeting to make Teen staff 
aware of the EBRPL Web Archive so that when they give their classroom talks at schools students will be 
more aware of the web archive. 

Special Collections also reached out the Eouisiana LGBT-t Archives Project organization to see if they 
would be interested in collaborating on a project to collect EGBT-i- web content for the state of Eouisiana. 
These talks have stalled due to time constraints but we hope to pick them back up in the near future. 

#BRWebstory: 

#BRWebstory was a campaign to get the community involved in collecting practices for the web archive. 
Every other month, the department would put out a call on social media with the #BRWebstory hashtag 
asking community members to contribute data that would be used to curate a specific collection on the 
web archive. Our first attempt involved local restaurants. Individuals would submit the name of a favorite 
local restaurant and the archivist would select websites to add to the collection. The response was decent 
the first time around. The department was able to collect over 20 names of restaurants to be added to the 
collection which resulted in over 100 seed UREs. After the initial attempt all others result in only a few 
submissions, sometimes none. We believe this is due to the fact that the archivist and the social media 
coordinator were reaching out to friends and colleagues on an individual level to gather the restaurant data 
but this was not feasible in the long-term and the campaign has been put to rest. 

Presentations: 

A representative for Special Collections spoke at the Eouisiana Archives and Manuscripts Conference 
(EAMA) on October 26, 2018 in Ruston, Eouisiana at Eouisiana Tech University. The panel, focused on 
the department’s experience implementing a local history web archive and the successes and failures of 
that implementation. 

Intellectual Control: 

Special Collections has put much of their focus on description of archived web materials and how to best 
present them to the patron from our ArchivesSpace instance. We’ve begun to implement some of the 
recommendations laid out by the Descriptive Metadata For Web Archiving: Recommendations of the 
OCLC Research Library Partnership Web Archiving Metadata Working Group and will continue to do 
more work like this over the next year. 
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Forbes Library 

Forbes Library has enthusiastically and deliberately approached the creation of a community focused web 
archive. The library was undergoing a Strategic Planning process at the time of the Community Webs 
grant application and built specific goals related to the training of staff in web archiving, identification of 
community partners for help in curating the collection, and building the collection itself into the Strategic 
Plan. We have trained all full-time Information Services staff in Archive-It software, and reached out to 
50 community partners (non-profits, fellow library professionals, web designers and other community 
organizers and leaders) for ideas on websites to preserve. Beginning in February 2018, we used a mix of 
one-on-one meetings and customized emails with a Website Capture google form attached, to solicit input 
directly from the community and our partner organizations. 

Many of the responses fit ideas and themes our own staff had identified as topics or sites for preservation, 
but we were particularly excited for sites or themes we hadn't identified in our own internal process. The 
library is developing a "landing page" for the web archive as part of a website revamp and hopes to try to 
use some creative means to help our patrons discover content within the collection. The work of curating 
this collection has reinforced and reminded staff of what was not saved or is not present in our physical 
collections. These gaps or silences often reflect subcultures in our community, stories that were not told 
on the pages of the local newspaper, or which might not be reflected in the websites of city government or 
local institutions. We have tried to remain conscious of what communities are being left out of the 
collections we are developing, and regularly seek new nominations from the public at public service desks 
and events to try to fill these gaps. By spreading the role of curation across the community, we can 
prevent staff burnout on the project and ensure that the perspectives represented in the archive are 
broader, more varied, and thus more reflective of our small city as it is. 

Collections created are organized under the categories: Arts, Music and Authors; History and 
Community; Politics, Government and Activism; Science, Agriculture and Environment and include 
websites and social media from local artists, musicians, activists, non-profits, small farms, politicians, 
local history organizations, historians, news weeklies and local government. The process of trying to 
create consistent and useful metadata for the web archive has inspired the library to look at its metadata 
practices as a whole, and we are looking to hire a consultant to establish a new metadata plan and ensure 
best and consistent practices for the entire archive. Inspired and informed by our fellow cohort members, 
we have written a web archiving policy which is being incorporated into an updated Special Collections 
and Archives policy for the library as a whole, and will be presented to the trustees for approval this 
summer. 

As part of the Community webs program, Dylan Gaffney has presented on the curation of the Forbes 
Library web archive at SAA 2018 in Washington DC, ALA-Midwinter 2019 in Seattle, and the 
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Community Webs 2018 Meeting in Columbus, OH. His presentation at ALA-Midwinter was recorded to 
become part of the Webjunction curriculum, and a blog post about Forbes Library’s web archiving efforts 
was shared locally and on the Archive-It and Internet Archive blogs. 


Grand Rapids Public Library 


Overview 

In the second year of the grant we deepened our understanding and practice of web archiving and digital 
preservation, and developed an advocacy narrative around web archiving. As an additional benefit, we 
were thrilled to find supportive and encouraging partners at Archive-It. 

Finding our stride in web archiving 

As we were finding our footing in the first year we wondered how we would ever run enough crawls to 
use our data budget. In the second year we found our stride with web archiving and swung in the other 
direction, finding more to collect than we had data for. We see a huge potential for web archiving in our 
local community, and have the skill set to move forward with a mature program. We are viewing web 
archiving as an important tool in preserving the increasingly digital record of the people of Grand Rapids. 

Forming an advocacy narrative 

Another significant area of development was in forming narratives around advocating for web archiving. 
Julie Tabberer spoke at the Symposium on the Future of Libraries at the American Library Association 
Midwinter Conference (January 2019), the Midwest Archives Conference (April 2019) and will speak at 
the Society of American Archivists Annual Conference (August 2019). 

The message that Julie focused on was why web archiving in public libraries matters. Preserving our 
history enables us to remember, critique and learn from our past. The record of our community is 
increasingly digital and online. Public libraries are key because we are solely focused on our individual, 
unique communities. Working on this grant project provided the context to develop this messaging, which 
is key to seeking support for web archiving and for public library archives. 

Providing specialized training 

Through the grant we are sending Colleen Alles, a part-time Library Assistant, to a week long class on 
digital material in special collections, through the Rare Book School. Colleen received a scholarship for 
the course fee but not for travel expenses. Having funds available for digital collections training made it 
possible for us to support her attending the course. We’re excited to have another staff member trained in 
digital collections work, supporting the ambitious goals of our archive! 

Support of Archive-It 

One of the best parts of the grant was meeting and talking with other public library archivists and with 
Archive-It leadership. There is something unique about doing archiving in a public library, and I deeply 
appreciate it that Archive-It staff recognize and value this. It is my hope that we can continue a 
relationship and work together in the future to advance the preservation of local and community history. 
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LA County Public Library 

During the second year of participation in the Community Webs cohort, LA County Library built a web 
archive that will serve as the foundation for a community-shaped web archive resource, built internal 
documentation and training materials on the mechanics of web archiving, and began to include web 
archiving as part of its regular digital collections development strategy. 

BUILDING A FOUNDATIONAL WEB ARCHIVE 

During Year 2, the Library built a foundational web archive that the community will be able to expand 
and shape in the coming years. It created a list of “candidate sites” identified systematically based on their 
potential to be useful long-term to various groups of users. The groups of sites identified were: 

• Sites relevant to any resident of LA County (e.g., sites administered by the county government) 

• Sites relevant to residents of smaller geographic areas within the county (e.g., sites linked from 
webpages of individual LA County Library locations; see the “local links” section under the 
“community” tab on this page! 

• Sites relevant to users of special collections, especially historical collections, within the Library 
(e.g., sites linked from webpages of LA County Library’s four Ethnic Resource Centers; see the 
“websites” tab on this pagel 

It then refined the initial list of sites by criteria such as site audience, site scope and contents, and 
potential vulnerability. The collection of 115 seeds that resulted from this selection process clustered in 
three areas: LA County government websites, cities in Los Angeles County, and Los Angeles County 
historical sites and museums. 

CREATING PROCEDURES AND TRAINING LIBRARY STAFF 

Building the initial web archive required developing internal documentation on how to do web archiving 
using Archive-It so that staff members other than the grant recipient would be able to work on the 
collections. Building on the Web Archiving Policy drafted in Year One, the Library developed a 
procedures document to guide staff through steps for harvest (including setting up test crawls and 
performing quality assurance), description and access (including creating metadata), and preservation and 
assessment (including seed reappraisal and policy review). The Digital Projects Coordinator trained one 
other staff member to review crawls, and training is slated for another staff member in the coming 
months. 

WEB ARCHIVING AS PART OF DIGITAL COLLECTIONS DEVELOPMENT 

The last major development to occur at LA County Library in Year Two of the grant was that web 
archiving became a legitimate concern to be addressed as part of regular collection development in the 
Digital Projects Unit. In planning the appraisal, processing, and digitization of an incoming archival 
collection (a large multi-format personal collection), the Digital Projects Coordinator was able to include 
the online components—both websites and social media accounts—as simply another format to be 
collected, processed, and made publicly available. This shift in approach would not have been possible 
without the Community Webs grant. 
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GOING FORWARD 

In the coming years, the Digital Projects Coordinator plans to continue to make the case for treating web 
archiving as a necessary collection development activity and web archives as an integral part of the 
Library’s digital collections. Remaining grant funds will be used to perform community outreach and 
education and to seek community input on the Library’s web archive. 


Marshall Lyon County Library 


Like many of our fellow cohorts, our experiences with community archiving has been a series of 
surprises, failures, and learning experiences. When we first applied to participate in the Community Webs 
project, we assumed that our area, which has a significant immigrant population and numerous arts and 
cultural opportunities, would have a comparable web presence. We discovered, however, that our 
community has been slow to promote and chronicle itself digitally. Our region’s digital footprint was 
considerably smaller and less developed than we’d expected. It was also largely comprised of Facebook 
Pages, many rarely updated and with no static webpage equivalent. 

Our collections began with two basic categories: the Marshall Community, which includes schools, local 
government, and community organizations; and Southwestern Minnesota Libraries, which includes 
libraries of the Plum Creek Library System. Following a catastrophic rain event in early July 2018, we 
also began attempting to collect websites and articles related to spontaneous events. So far, that has been 
limited to large-scale weather events. The July 2018 Rain Event collection contains government data, 
individual news articles, and other media coverage related to the intense precipitation and subsequent 
flooding. It was largely reactionary, which made it difficult to collect community reactions on social 
media. Our collection for the record-breaking winter of 2018-2019 was more intentional, and we were 
able to capture a few social media outlets for government agencies. In those cases, we chose to crawl 
Twitter rather than Facebook due to continued difficulties in successfully capturing Facebook. Because 
our community primarily uses Facebook, this meant that we were unable to capture as many comments 
from community members as we would have liked. 

For the Winter 2018-2019 collection, we also attempted to collect media and stories from community 
members using a Tumblr blog. By asking for submissions, we hoped to collect digital content that may 
have been posted to Facebook and would otherwise be impossible to locate and archive. We chose 
Tumblr because it is a free service, it has a built-in submission feature, and it allows us to organize 
submitted material and customize how it is displayed. Tumblr blogs also seem to archive well. Through 
requests on Facebook, in our library publications, and on our in-building displays, we asked community 
members to submit their photos, video, and text stories through Tumblr. This was not particularly 
successfiil. A few library patrons submitted photos, but they emailed them rather than use the Tumblr 
form. The only submissions received through Tumblr were from librarians. If we use this method in the 
future, we may only request email submissions. We will also advertise more widely. 

In March of 2019, we presented at the Library Technical Conference in St. Paul, Minnesota, with the 
Internet Archive and The New Brunswick Free Library of New Brunswick, New Jersey. The presentation 
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explained the Community Webs project, challenges and best practices, and ways other organizations 
could get started with web archiving. 

Although we intend to continue archiving local organizations and government pages, we plan to put most 
of our active efforts into archiving material related to spontaneous events or short-term projects and 
initiatives. Next year will mark Lyon County, MN’s sesquicentennial, which should provide an 
opportunity to collect both current and historical community content. We also intend to encourage 
community members and organizations to increase our digital footprint through programming, such as a 
bi-monthly Social Media Breakfast, and by providing access to technology with our combination 
digitization lab and digital makerspace, currently in development. This will create more opportunities for 
digital content to be archived, further community interaction, and increase visibility of information and 
cultural content to visitors and new arrivals. 

When considering a community web archiving project, we recommend that organizations evaluate the 
digital footprint of their community. We also, however, recommend starting crawls on known sites of 
interest as soon as possible, even by means of a less robust crawler, so as not to lose access to existing 
data. Whenever possible, test crawl first. Plan time to evaluate seeds and crawls, as well as consider new 
or different options. You may be able to bring in volunteers to check on established crawls periodically to 
ensure that seeds don’t need new scope rules. 


New Brunswick Free Public Library 


The New Brunswick Free Public Library greatly enjoyed being a part of the Community Webs Cohort. 
After attending the cohort meeting in Ohio, I felt reassured that the problems I was encountering when 
using Archive-It were not unique and I was much more comfortable expanding our collections to 
previously unused platforms (such as Instagram and map websites). Having the Help Center available to 
look up information in articles, or to submit tickets, was frequently a life-saver and I stressed that point in 
my presentations, but I found that I learned the most when I discussed web-archiving with other cohort 
members. Looking back on all of the crawls that have been captured, I feel very proud of myself for 
learning a new technology and now automatically think “I should save that to the Web Archive!” when I 
come across new information online. 

Our requests from the public for suggestions on what to crawl were unsuccessful. Therefore, I used our 
Social Services binder at Reference to try to think of any organizations, clubs, newspapers, etc. to save to 
our Archive. We have created 15 Collections with over 70 seeds combined. Most seeds have been 
crawled under 5 times, mostly because those are static pages that don’t update (The “About Us” pages on 
the city’s theatre websites are prime examples). Others have been updated with regularity either weekly, 
or because of data limits, monthly (i.e. the library website and the City website). A recent collection that 
was just added in early March, 2019, was a podcast titled “New Brunswick Speaks with Joyce Browning” 
from our local AM radio station. Four Librarians were invited to be guests on the program in late 
February, and that is how we found out about it. Since then, I have been adding the podcast weekly and 
providing full metadata on who the guests are. This collection is one of the best resources I have saved 
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because Ms. Browning has interviewed the Mayor and his assistant, the heads of different city 
departments, the heads of city non-profits and cultural institutions, and business owners. 

In Year One, I attempted to take every class online and in person on anything related to Web Archiving, 
since I felt I had much to learn. Towards the end of Year One, I already knew more about Web Archiving 
than the classes were teaching. I was left with a surplus of $1161 at the end of Year One, and had no more 
classes to take, so I decided I would have to attend more conferences outside of my local area. The 
opportunity came up to attend the LIBTECH Conference in Minneapolis, Minnesota, so I was a 
co-presenter there with Emilirose Rasmusson and Kyrie Whitsett from Archive-It. I found Emilirose’s 
portion of the presentation enlightening and learned how to add Metadata to Excel tables in order to bulk 
modify. After that, I made sure that every seed in my archive had at least basic level metadata. I looked to 
other library archive pages (such as East Baton Rouge and Queens) to figure out how to label some of the 
pages I saved (such as podcasts and social media sites) since I was unsure on some of them what Type, 
Format and Rights should contain. The following week, I attended the Computers In Eibraries Conference 
in Crystal City, Virginia, and co-presented with Agatha Monahan, Paul Kelly, and Karl Blumenthal. At 
that conference, Karl informed me that it was no longer required to add seed scoping rules for most pages 
(such as Wix and Facebook) because of new updates. That discovery was remarkable and helped me so 
much when I decided to add new Facebook pages to collections. I also attended the New Jersey Eibrary 
Association Conference in Atlantic City, NJ. I presented at this conference in Year One, so this year I 
only went as an attendee and to network and discuss the Web Archive. I brought along brochures that I 
updated from last year to include some of the websites we are saving, and I had three people approach me 
who remembered my presentation from last year and asked for more information. I attended a 
presentation titled “What do we do with This?: Managing Bom Digital Records” to see what they were 
presenting, and to add information about Web Archives. 

Now that the grant period is winding down, I have gone through all of my collections to make sure that 
the pages are still saving correctly, patching crawls for the first time (it took me this long to figure out 
how to accomplish that!), and finishing up metadata for everything. I will probably make one final push 
for new websites to save before the deadline in order to maximize the data I have used and in order to 
have those sites on record in the future. Unfortunately, although we have used Google Analytics to track 
the users we are reaching, I do not think there is enough demand for my director to allocate funds to 
continue web archiving with Archive-It. The hardest part of the process hasn’t been the actual crawling of 
pages, it has been publicizing the archive to people outside of the library. In a few years, we will most 
likely find a need to look back on the websites we have saved now to see what was going on in 
2018/2019, or what churches were here, or who were the department heads... .but right now, little has 
changed in the city and the few people who have found the web archive have not reached out to us to tell 
us why they were using it. 
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Patagonia Library 


Patagonia Library’s collection Patagonia, Arizona 2017 and Beyond is now home to nineteen Websites, 
one Facebook page and one video. One website is no longer available online; we captured a couple of 
updates before it disappeared. Unfortunately, I left out the crucial step of scheduling crawls for this site, 
the crawls were scheduled at the Nov 2 cohort meeting. 

Continued conversation in the community led to inclusion of a site that describes international mining 
interest in the Santa Rita Mountains northwest of Patagonia. An additional Facebook page developed and 
utilized by a new local nonprofit. Voices from the Border will be added shortly. Strong local support for 
the project still exists. 

The time to create metadata has arrived and I will now focus on this. 

Sadly, two attempts at proposals for the Association for Rural and Small Libraries, one for 2018 and one 
for 2019, were not accepted. The proposal to present at the Society of Southwest Archivists was also not 
accepted. I found it interesting that at the Southwest Archivist conference that there were no sessions 
about bom digital archiving and it was challenging to engage other attendees in conversation about the 
Community Webs project. 


Queens Library 

Web Archiving:This was our third year of web archiving at Queens Library. We met again this year with 
our internal curation stakeholders. Government Affairs, Communications and Archives to give them an 
update on the seed crawling we’re doing for them. Our Communications staff became more regular this 
year in their sharing of press clippings for us to crawl. Next year, we hope to entice them into actually 
USING the crawled sites when they need to look something up that was written about the library. We’re 
adding basic metadata to these captures in hopes of making them more searchable. 

Our two-person web archiving team of Technical Support Aide (Pablo Jarama) and Data Specialist (Kate 
Finley) continued their established workflow of setting up test crawls, reviewing and saving. We were 
about to exceed our data limit in April 2019 and the Archive-It team was kind enough to boost our 827 
GBto 1.5 TB. 

Public Outreach: Queens Library’s Head of Metadata Services (Natalie Milbrodt) reached out to a 
number of allied professional groups before getting a response of interest from our colleagues at Queens 
Museum. We discussed their digital preservation needs and agreed to crawl the exhibition pages on their 
website. During our conversation, we learned that they lost a group of event videos in a hard drive failure 
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a couple years ago and have been meaning to step up their A/V preservation protocols ever since then. 
Because we started a conversation about how we can help by web archiving, we can now continue to 
work out ways that QPL can assist them in their digital preservation. 

Professional Outreach:In August, Natalie co-presented on the Community Webs panel discussion at the 
Society of American Archivists meeting in Washington D.C. In March, Kate Finley attended the National 
Forum on Ethics and Archiving the Web at the New Museum in New York City. In April, Natalie 
co-presented in a session at the Museum Association of New York Conference in Cooperstown, New 
York. The session was about community archiving practices employed by the Queens Memory Project at 
QPL, which Natalie administers. Museum professionals were largely unfamiliar with web archiving and 
interested to learn how we folded it into the offerings we make to organizations in our community. In 
June, Natalie recorded a Web Junction webinar on QPL’s experience so far with web archiving. 


Facilitator Guide for MLIS Grad Student Workshop Sessions 
Developed by Kate Finley 
Workshop 1 

• Collection overview 

• Basic terms (will use Press Clippings collection to show examples) 

o Crawls 
o Seed 

o Seed components 

■ Documents 

■ Hosts 

• Successful Wayback Machine crawl example (backend only - Press Clippings) 

• Seed & crawl settings overview & examples / Setup of selected seed & crawl 

o Adding seeds 

■ URL entry 

■ Seed settings 

• Access level 

• Frequency 

• Seed types 
o Running crawls 

■ Crawl types 

■ Crawl limits (particularly time limits) 


Workshop 2 

• Review of Workshop 1 content 

• QL workflow 

o Review of workflow info covered last time 

■ Team members involved & our tasks 

■ Workload (currently, primarily Queens Library Press Clippings collection) 

• Weekly archiving requests 
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• Archiving request backlog 
o Workflow tracking options 

■ Excel data exported from crawl pages - will show example of crawl spreadsheet I 
work with to keep track of bad crawls, crawl re-runs, etc. 

■ Archive-It account seed and crawl Notes field 

• Viewing crawl results 

o How to view test crawl results 

■ 24-hr. turnaround time 

■ Where to find 

• Adding "-test" in address bar 

• Via crawl reports 

• Reviewing and troubleshooting 

o Basic reviewing/troubleshooting 

■ Comparison of archived seed and live counterpart to look for missing 
components (using test crawl set up last week as example - 
https://partner.archive-it.org/1068/collections/9653/crawl/554278 - was 

successful!) 

■ Reviewing crawl in terms of data budget 

• QL’s data budget particulars 

• Effect of our test crawl on our data budget 

■ Saving test crawl results (am waiting until the workshop to do this for new crawl 
w/ students) 

■ Reviewing crawl reports (using "world)ournal" seed example - robots.txt blocked 
a portion of the site) 

• Reviewing crawl report Hosts tab 

• Resolving (by adding collection-level Ignore robots.txt rule) 
o Advanced troubleshooting/Wavback OA (Marvl! 

• Common problems and solutions 

o Robots.txt exclusions 

■ crawl prevented by robots.txt exclusion (using "claim" qchron.com seed 
example) 

■ Solutions 

• Adding robots.txt exclusion at collection level 

• Checking for robots.txt before crawling (adding "/robots.txt" at end of 
address in address bar) 

o Missing images and video (example involves missing images - "green" DNAinfo seed) 

■ Checking image locations (and checking crawl report Hosts tab for out of scope 
content) 

■ Adding Expand scope rule to capture images, etc. located at subdomains 
o Multiple Wayback links 

■ Reasons for multiple links 

• Re-crawls (interaction between Umbra and Heritrix) (using "expect" 
qns.com seed as example) 
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• Capturing different versions of same URL (e.g., https and http - using 
Melinda Katz twitter feed as example) 

• Metadata/content findability 

o QL practice 

■ QL currently focusing on seed-level metadata for Press Clippings collection 
(Title, Source, and Date) 

■ Current workflow - adding seed by seed 

o Batch metadata work (will show Google Sheet of our exported metadata for Press 
Clippings collection) 
o Front-end search 

■ Metadata-related search 

• Seeing our added metadata as facets 

• Search example (''Jamaica") 

■ Full-text search overview 

• Continuation of search example ("Jamaica") 

• Full-text search basics (indexing time, advanced search, indexing of 
non-Roman languages) 

• Questions and account exploration 


San Diego Public Library 

Activities 

In the second year of the project I recruited three interns from local colleges and high school to assist with 
identify seeds for the many collections that we started. We created a web portal promoting our web 
archives that also includes an online form soliciting collecting topics from the public to better reflect the 
interest of the community. Our web portal: https://www.sandiego.gov/blog/web-archive 

Professional Development 

I attended the second cohort meeting in Columbus, Ohio. The cohort meeting was very helpful and 
information to hear about the different collections that other libraries are creating and collection. I was 
very inspired by the various collections that I also started some new collections at my library. In early 
June 20191 attended a weeklong course titled Community Archives and Digital Cultural Memory in 
Philadelphia. The course was very informative and inspiring to learn about different types of community 
archives that include web archives 

Planned Conference Attendance 

August 2-6, 2019 - Society of American Archivists Conference 
October 14-16, 2019 - Digital Library Federation Forum 

Collections 
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The San Diego Public Library is currently working on the following collections and we are continuing to 
look for new subject areas to collect that represent the diverse communities and interests in San Diego. 

The 1915-1916 Panama California Expositions: California’s Past, Present, Future at Balboa Park 
1915 was San Diego’s most notable year in the twentieth century. The Panama-California Exposition held 
in San Diego that year put the small town in the southwest corner of the United States on the map. In 
2016, the City of San Diego and Balboa Park celebrated the 100-year anniversary of the event that left a 
permanent mark in San Diego in its development of Balboa Park. 

LGBTQIA Collection 

San Diego has a vibrant LGBTQIA community. This collection includes LGBT organizations and 
resources in San Diego. 

Neighborhoods Collection 

San Diego has over eighty neighborhoods rich with history with many community organizations. This 
collection showcases the many local organizations in San Diego including business improvement district 
organizations to neighborhood historical associations to town councils. 


Schomburg Center for Research in Black Culture, The New York 
Public Library 

The Community Webs Grant has changed the trajectory of how we see collection building and 
development at the Schomburg Center. Curating the web as the next frontier for collecting and preserving 
content by and about people of the Black diaspora has inspired librarians and archivists regarding the 
possibilities of better understanding online communities and digital archives. The impact of this 
collaboration will be far reaching where the tide is turning regarding larger institutional buy-in from staff 
at other research libraries and within New York Public Library’s administration. However, even if that 
buy-in does not occur, I feel confident that my work at the Schomburg Center to build awareness and 
educate on web archiving was success where our staff and scholars understand the educative, historical, 
preservative power of building web archives. 

Collection Building 

The second year of the grant has been an exercise of building and scaffolding. Using the first year of the 
grant as my foundation where it was mainly about testing out Archive-It, its functionality and learning 
how to use the tool, this year’s focus has been to build multiple web archive collections, raise awareness 
about our new web archiving program through national presentations, and provide more educational 
outreach to my colleagues and community. Thus far the Schomburg has 15 collections in Archive-It that 
focus on Harlem, the Black diaspora, or they align with the collecting priorities of the 5 Divisions that 
make up the Schomburg Center: Art & Artifacts, General Research & Reference, Manuscripts, Archives, 
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& Rare Books, Moving Image & Reeorded Sound, and Photographs & Prints. Here is the list of eurrent 
eolleetions both publie and private: 

• Blaek Politieians National Eleetions 2018 - Present 

• #HashtagSyllabusMovement 

• Artists and Art Organizations 

• Digital Sehomburg 

• Sehomburg Livestream Arehive 

• Afriean Ameriean Professional and Edueational Organizations 

• Harlem Aetivism 

• Harlem Business Direetory 

• Harlem Blogs 

• The Green Book Web Resourees 

• Blaek Colleeting and Eiterary Initiatives 

• Hip Hop 

• Sehomburg Blogs 

• Blaek Ameriean Deseendants of Slaves 

• Deeember 12th Movement 

Project Activities 

I’ve had the opportunity to attend two eonferenees diseussing web arehiving, submit eonferenee proposals 
to edueate others about the grant and my projeet, and finally present on different oeeasions at NYPE and 
the Sehomburg Center on the grant and web arehiving. 

• Joint Conferenee of Eibrarians of Color in September 2018, the poster proposal “Hashtagged 
Syllabi: Web Arehiving Raee and Soeial Justiee Edueation.” 

• Presented alongside Jefferson Bailey at the International Internet Preservation Couneil in 
November 2018 in Wellington, NZ 

• Sehomburg Center All Staff Meeting 

• NYPE’s Researeh and Writing Interest Group 

• DPEA Fest 2019 eo-presenter with Jefferson Bailey and Melinda Shelton featured in an artiele in 
Ameriean Eibraries 

https://americanlihrariesmagazine.org/blogs/the-scoop/communitv-collaboration-dplafest-2019/ 

• NYPE’s Researeh Eibraries Conferenee - “The Web is an Arehive: Colleetion building through 
web arehiving eurrent events at the Sehomburg Center” 

• SAA eonferenee proposal aeeepted Community Webs and the Sehomburg Web Arehive 

• NYARC Advaneing Art Eibraries - Invited to partieipate in these meetings Feb 2019 and April 
2019. Have established a working relationship with Sumitra Dunean. The eurator of Art & 
Artifaets was able to partieipate in the February meeting too! 


Future Plans & Sustainability 
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Although Community Webs has concluded, the Schomburg Center is committed to building digital 
collections. We were recently awarded a Mellon grant to continue the work that Community Webs 
allowed us to begin. The #SchomburgSvllabus is a $226,000 2 year grant that builds on the 
#HashtagSyllabus collection with the intent to create digital collections focused on social justice as well 
as Black Studies. For this grant we are recruiting a digital archivist to carry out the grant objectives as 
well as build on the infrastructure that Community Webs allowed us to get in place. 

Creating collections that are timely and that also resonate with the Black community, I believe really 
demonstrate the power of web archiving. This is particularly true because the web is a place where Black 
people are driving content. The concept of “Black Twitter” is evidence of why this work aligns with our 
mission to preserve and document the Black Diaspora. Web archiving current events that speak to and 
inform our existing collections is a strategy for supplementing our historical collections. For example, the 
Schomburg Center holds the nation’s most comprehensive collection The Green Book travel guides but 
with the fdm’s adaptation of these travel guides, it became imperative to start web archiving content due 
to issues of search engine optimization where websites with content about the creators, Victor and Alma 
Green were being buried in search results for the fdm. By web archiving these sites and pages, the hope is 
to ensure against their web presence being lost and forgotten. This will be the model for how we connect 
web archiving to our collection holdings. Also there are plans to resurrect the Schomburg Clipping Files 
1925-199? Consistent clipping file work stopped in the mid to late 1990s but using web archiving, we can 
update the practice and expand on the subjects to include modern advancements in technology and also 
include the digital presence of a new generation of artists, entertainers, scholars, etc. 

A Note of Gratitude 

Thank you to IMLS and to the Internet Archive for creating an opportunity for public libraries to do this 
kind of work. This project has been a creative outlet that has challenged me and sparked my creativity 
regarding all the possibilities of community archives and collaborative collection development. As 
libraries continue to evolve, I am happy to have been a part of the Community Webs project which 
created a solution-oriented response to the changing landscape of digital collection development. The 
collections produced from this grant respond to not just our present reality but also anticipate our future 
users and our future selves. 


San Francisco Public Library 


Our second year of the grant started out slowly due to short staffing in our department. Fortunately, we 
have filled several positions and now currently have 3 people working on our web archiving projects 
including two librarians and a library assistant. During this second year we have created a collection 
development guide, a takedown policy, experimented with soliciting URLs in a few different ways, and 
created several new collections. 

In November 2018 I flew to Columbus, OH for the in person meeting. I learned a lot from the grant 
participants and appreciated the opportunity to meet in person. I also attended ALA Midwinter and 
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Annual, and DPLA Fest. I attended the Community Webs programs at Midwinter and at DPLA Fest, and 
picked up some new strategies and ideas from those presentations. 

The primary collections developed this year were created in conjunction with several affinity centers at 
SFPL including the LGBTQIA Center, the Filipino American Center, the San Francisco Flistory Center, 
the Chinese Cultural Center, and the Stegner Environmental Center. Our staffed provided an introductory 
training which covered the basic concepts behind web archiving and specifically this grant and then 
worked with the staff over two months to help develop and refine their web archive collections which 
complement the Center collections and represent the communities they serve. 

We experimented with another URL gathering approach by emailing the SFPL community at large and 
asked for nominations for people's favorite neighborhood blogs and websites. We received 19 
nominations, with a few duplications, and hope to continue to build as new nominations come to us. We 
are currently planning a third collecting project. Shuttered SF, to collect the websites local business and 
organizations before they close. We will be announcing that project at librarian programming meeting 
and then will have a link to the nomination form posted on our intranet. 

We have learned quite a lot this year! We’ve discovered inconsistencies in metadata within our Archive-It 
collections which we started in 2008, i.e. different date formats, different naming conventions, and 
different levels of descriptive metadata within collections. We are developing our own standard of 
minimal description informed by the Community Webs project using authorized LOC subject headings 
and name authorities. LOC standards match our current cataloging practices and enable the metadata to be 
repurposed in the future. We’ve also become aware that test crawls, while extremely useful, should be 
considered carefully when taking into account how much data will be captured. The test crawl may not 
completely capture all images and videos on data heavy sites and require patch crawls that will greatly 
increase the data amounts. This can be mitigated by increasing the amount of time crawling a site (1 day 
vs. 5 days). This is not an issue on sites that aren’t data heavy. We have decided to automatically use 
“ignore robots.txf ’ setting for efficiency. When this setting is not used, we almost always have to re-run a 
test crawl or do a patch crawl with “ignore robots.txf’ as a rule. We have put this in our collection 
development guide. We continue to have problems with social media sites, particularly with those with 
the “infinite scroll” or other dynamic content. We have decided not to capture them until we have more 
time to trouble shoot. It is our hope to work this out in the next year. 

With the knowledge gained creating these community collections, our staff plan to revisit our original 
collections of San Francisco government websites over the next year. We plan to expand the collections, 
do quality control for existing crawls, and refine the metadata to create better access. We will also 
continue to collect URLs from staff for the collections we created this year. 


The Urbana Free Library 


At The Urbana Free Library, the second year of the Community Webs project was spent further refining 
our collection development goals and strategies for the web archiving of Champaign County, Illinois. A 
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critical step toward developing these goals, as with any form of collection development at my repository 
is, how can we build relationships, and engage those who we want to document, as active participants in 
the documentation process. To assist my institution with moving forward with this step of the Community 
Webs project, I again took advantage of my role of archival educator at the University of Illinois at 
Urbana-Champaign, School of Information Sciences, and taught another eight-week course in the spring 
2019 semester on Community Archives. The course provided ten graduate students with an introduction 
to key concepts and issues associated with community archives, but with a particular focus on introducing 
them to web archiving, and the importance of preserving local online records in the age of the web. 

Throughout the semester, students played an active role in working with organizations that I had 
previously selected for inclusion in our library's Community Webs initiative, and that represent 
historically significant institutions and anchors in our communities and neighborhoods. Students were 
introduced to the joys and challenges of working with our local community's stakeholders by building 
relationships and working together in assessing their documentation. Students learned how the 
preservation of web-based content would complement and supplement our library's existing 
documentation activities, by preserving web-based materials created by the individuals and organizations 
we already actively document. But also how preserving web-content enhances our ability to record those 
hidden voices that are under-represented in our archives. 

Each student selected an organization for their community webs project at the start of the semester. Next, 
students completed an assessment of the existing archival documentation of the organization by 
researching the holdings of the Champaign County Historical Archives at The Urbana Free Library, and 
other area archives and museums. Next, students and I made contact with each organization and set up 
meetings to discuss their archival documentation, web presence, and the Community Webs initiative. 

After reviewing the existing archival documentation, web presence, and also insight from each of the 
organizations about how they use and value their online presence, students were provided with a Website 
Appraisal Checklist, to use as a tool to both determine the archival or permanent values of the web 
presence of the assigned organization, and to evaluate the quality of the organization's web crawls. 

Finally, students were provided with a Finding Aid Template and using archival descriptive standards 
drafted a finding aid for the organization's Community Webs collection. At their final presentation, 
students presented what they learned about the organization, an evaluation of its web presence, addressing 
relevant portions of the finding aid template and the web-appraisal checklist. 

I have attached a finding aid prepared by one of the students who were assigned to work with the local 
chapter of the League of Women Voters; I also included a journal entry he posted as a final reflection on 
the experience. 

In the near future, this finding aid and the others prepared by students for each organization will be hosted 
on our online catalog. Local History Online http://archivescatalog.urbanaffeelibrarv.org/polaris/ . I hope 
that the procedures developed over the past two years can serve as a guide to further develop The Urbana 
Free Library's Community Webs project. 
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Compiled by: [STUDENT] 

Inclusive Dates: Website Captures [2013 - oo], bulk [2015 - oo], 

Website Records [November 2012 - oo), 

Physical Collections [ 1922 - 2016] 

(110) Creator: League of Women Voters of Champaign County 

(245) Title: League of Women Voters of Champaign County Web Collection 

(300) Physical Description/Technical Access: 

This record is the website of the Champaign County Illinois Chapter of the League of 
Women Voters. It includes thirty-two crawls between May of 2013 and Lebruary of 2018. The 
website will be crawled monthly going forward. The website is accessible through the following web 
address: http://www.lwvchampaigncountv.org . The archived captures of the site are accessible through 
the Wayback Machine provided by the Internet Archive. To access the 
captures, visit the following web address and input the organizations web address into the 
Wayback Machine search bar: http://archive.org/web/ . The League of Women Voters of 
Champaign County's website is accessible without restrictions and archived captures of the site through 
Archive-It are also accessible without restrictions. 

(351) Organization & Arrangement; 

The website is arranged in a series of drop down tabs with various links. Prom the home page, the 
tabs are listed as 'HOME', ABOUT', 'OUR WORK', 'VOTING & ELECTIONS', 

*RESOURCES', 'OBSERVERS”, “STORE', and 'CART (0)'. The ‘ABOUT' tab offers 
information about the chapter's mission, what they do, their herstory, board member bios, 
bylaws, and how to contact them. Under 'OUR WORK' they offer links about opportunities to 
get involved, current issues addressed by the chapter, their calendar, blog, and some other 
documents. In ‘VOTING & ELECTIONS' there is a candidate forum as well as information on 
registering to vote and an elections calendar so people know when to vote. Under 
RESOURCES' there is a tab with some digital copies of their newsletter ‘The Voter', a 
government guide, other helpful links and one with information dedicated specifically to 
members. The other tabs do not offer any additional drop-down tabs. The site is available in 
French, Spanish, or English, directly accessible through pages on the site. 

(490) Series: Special Collections - 

'The Voter' monthly newsletter digital files, November 2012- 
Misc. Blog Archives January 2012 - February 2016 

(490) Series: External Web Resources - 
http://www.historv.com/topics/the-fight-for-womens-suflfage 

http://www.vote411.org/ 

http://www.elections.state.il.us/ 

https://www.champaigncountvclerk.com/elections/registration 
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http://www.elections.il.gov/vot.inginforniation/EarlvVotingT.ocations.aspx 

https://www.lwv.org/ 

http://www.lwvil.org/index.html 

http://www.lwvmclean.org/ 

https ://www.faeebook. eom/LW V CC1 / 

Also ineluded is a list of all the eities and towns in Champaign County along with their offieial 
website when available. 

(516) Type of Computer File or Data: 

There are a few pdfs available in the 'Doeuments' seetion under the 'OUR WORK' tab 
and in other plaees. Most of the other eontent on the website is aeeessible direetly through the 
site. There are also some YouTube videos embedded into the site from time to time. 

(541) Provenanee (Souree of Acquisition): 

Content on website is submitted and added by organization members. 

(545) Biographieal/Historieal Note: 

The League of Women Voters is a national organization with local chapters that formed in 1920 
by Carrie Chapman Catt as a suffragist advoeaey group. The Champaign County ehapter formed in 1922 
and remains aetive. They support soeial and non-partisan politieal advoeaey issues in an effort to ereate a 
more aetive loeal populaee and make a differenee in the eommunity and beyond. Their three main goals 
are to register voters, edueate voters, and aetive voters. 

(520) Seope and Content Note: 

This eolleetion is eomprised of reeords from the League of Women Voters of Champaign 
County's website. These records include media postings from the group, information about the group, 
links to websites related to the organization, blogs, newsletters, loeal government guides, voting 
registration assistanee, and membership information. The website is regularly updated. 

(544) Location of other arehival material: 

Physieal reeords are held in two different locations. The Illinois History and Lincoln 
Collection at the University of Illinois-Champaign/Urbana holds 19.8 eubie feet of their reeords. This 
eolleetion's inelusive dates are 1870-2009, with the bulk of the reeords from 1922-2009. These reeords 
inelude administrative records of various sorts, along with meeting minutes, serapbooks, and publieations 
from the organization. They reeeived aeeessions in 1972, 1983, 1997, 2000, 2007, and 2011 by members 
of the League and others elose to the League. The Champaign County Historieal Arehives holds the 
remainder of the available physieal reeords of the League. Along with a reeently proeessed eolleetion that 
is yet to have an available finding aid, the CCHA have multiple oral histories from League members 
available digitally, a photo file, an ephemera file, newsletter records, multiple publieations from the 
league, and the reeords of former member Jean Burkholder. 

(561) Provenanee (eustodial history): 
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Current physical records are donated to the Champaign County Historical Archives at the Urbana 
Free Public Library. 

(600) Subject-Corporate name: 

League of Women Voter's 

(600) Subject-Topical: 

Suffrage 
Voting Rights 
Voting 
Franchise 
W omen-Suffrage 

(600) Subject-Geographic: 

Illinois 

Illinois-History 

Illinois-Politics and Government 
(856) Electronic location/access: 

https://web.archive.org/web/*/https://www.lwvchampaigncountv.org/ 

https://www.lwvchampaigncountv.org/ 

[STUDENT] Journal #6 

I met with the Eeague of Women Voters last Friday. Anke and I attended one of their monthly 
board meetings held at the Champaign Public Eibrary on. The board is composed of six women. They 
were excited about the project and eager to learn about archiving, web-archiving, and why their records 
are important to archives. We briefly explained the genesis of the project, our goals, my contributions, 
and my thoughts on their current site and previous crawls. My search of previous crawls revealed that 
some of the video content was not saved. As a result of this, I suggested they add 1 -2 sentence annotations 
to each video along with a hyperlink to the YouTube page containing the video in case the embedded 
videos do not work in ensuing crawls. I also briefly discussed the locations of their current physical 
records and some tactics and ideas they may want to consider in relation to those archives. I suggested 
adding a section on the website information regarding the location of their physical archives (short 
paragraph or two), along with hyperlinks to the different finding aids for those collections or at least li nk s 
to the archives pages with information on how to locate different finding aids. It is unreasonably difficult 
to locate these archives, and I felt adding direct access through the EWV website may increase traffic to 
those collections, certainly awareness. 

One member was particularly interested in why the project was undertaken and how the EWv fit 
into the project. She was not clear on why the EWV website needed to be archived. Who was the 
audience? Who would look at these records? I explained that their website actively created history of their 
organization with every new piece of content added. The EWV was and is an immensely important 
organization nationally and locally for advancing progressive social justice reform. Their actions are 
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important for local historical record and arguably beyond. Researchers in a variety of fields my seek their 
records for projects related to Women's rights, social justice reform, Women's suffrage, local 
organizations, and much more. 

We discussed how often the site should be crawled and agreed that once a month should be 
sufficient. They hold monthly meetings and content often changes on the site as along with the meetings 
and different ideas/proposals presented during the meetings. I explained that as long as the site was 
regularly crawled, it would be in their best interest to add as much content as possible to the site. The 
more they add, the more would be saved for future generations of researchers and other interested parties. 

The LWV was very energetic and excited about the project. They were motivated to work on the site and 
make necessary changes to ensure it is well-archived going forward. The meeting was fun and interesting. 
It really showed me how important it is to meet with the different organizations being web archived. We 
both learned a lot from each other throughout our conversation and it was very instructive to both of us on 
how to proceed with greater success. It left me wary of the libraries/archives that chose not to meet with 
their groups. Will this sort of MPLP be most beneficial to both parties and well represent the group and 
their sites? We may only hope. 


West Hartford Public Library 

This year I have added 1 additional collection for a total of 5 collections. However, my archived data 
amount has more than tripled from 49. 7 G to 185.9 G. Adding seeds to existing collections and 
successfully crawling Facebook sites contributed to the substantial increase in archived data. I was 
unsuccessful crawling Instagram sites, but I hope to retry crawling Instagram sites during the next year. 

I attended the final cohort meeting in Columbus, Ohio in October and found it very helpful. Sharing 
obstacles as well as successes with other cohort members was heartening and I returned to CT with a 
renewed sense of purpose and motivation. 

For each collection, I have added an image and metadata. In addition, I have been adding metadata to 
individual seeds within collections for greater access. For one collection, I crawled multiple pages as their 
own seeds to take advantage of the Umbra assistive crawler for better results. 

I reconfigured the Library’s Local History landing page to highlight the Community Webs project and to 
help patrons easily access our page on Archive It. Also, this year the Library’s Materials Selection Policy 
was amended to include a section on Local History Materials. This new section included online material 
as a format to cover both our digital and web archiving information. In addition, online materials were 
incorporated into the Local History Collection Description document which describes the scope of items 
collected. 
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Grant money allowed me to attend the Digital Direetions Conference in Atlanta, GA. This annual 
conference sponsored by the Northeast Document Conversation Center is specifically geared toward 
professionals working with digital collections at archives, libraries, museums, historical organizations, 
government agencies, corporate archives, and other organizations that steward digital collections. 
Attendance helped me in curating the collection and the use of metadata. 

In March, I did a presentation on my experience with web archiving at the statewide Reference and 
Instruction Roundtable and was part of an Archive-It group that presented at the Computers in Library 
conference. On May 14, Dale Musselman recorded the presentation that I gave at the Computers In 
Library conference to be combined with other cohort presentations to become part of an online course for 
librarians new to web archiving. I will be presenting in October 2019 with two members from the cohort 
at the New England Libraries Association annual conference. 

Take-a-ways from this year include: Learning, again, the importance of web archiving, an idea that was 
reinforced when I realized that the information on a newly built school (2016) is now no longer accessible 
on the school’s web site. Secondly, even though I have progressed in my web archiving skills, I still 
consider myself a novice primarily because of the increasing complexity of websites and the speed at 
which they change without any notice. In addition, I have learned that the process of crawling and 
reviewing takes time, a precious commodity when we are short staffed. The staff at Archive-It was very 
helpful again this year. However, it is hard to shake that feeling of isolation when you run into difficulties 
knowing that it would be much easier to speak with someone than to go through the process of submitting 
a ticket and waiting for a response. 

Going forward I plan on adding a collection devoted to businesses in West Hartford and then non-profit 
and religious organizations. Also, we are trying to determine the best way to achieve community interest 
and involvement with the library’s developing web archiving collection. Finally, I will be attending the 
Archive New England Partners meeting in August. 
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Athens Public Library 

The Athens-Clarke County Library Heritage Room staff is just now finalizing metadata for our seven web 
archival collections. This journey has not been an easy one. There were many challenges along the way, 
mainly staff turn-over. However with a huge leap of faith in the past year we have put together a fantastic 
collection for researchers in the future. We have created the following collections: Business and Industry; 
Food Culture; Contemporary Visual Art, Music, and Literature; History and Community; Politics, 
Government, and Activism; Athens-Clarke County Library Heritage Room Collection; and Athens 
Regional Library System Collection. Ultimately we were able to archive 200 local websites and pages in 
less than a year. 

Our main challenge has been training and staff time. As a department with only 2.5 full-time employees 
who manage a genealogical/local history reading room and archive that is open and staffed 42 hours per 
week, we were left with little time to train and work on the Community Webs project. However, now that 
it is up and running, with clear collecting policies and procedures, maintaining and adding to the 
collection over time will be easier. 

We have yet to officially add web archiving to our collection policy. This is for a variety of reasons, 
mainly because we are not sure if maintaining the cost of a tool like Archive-It is within our ongoing 
budget. However, our current collection policy is broad enough that adding web archiving is not a stretch. 
Our collection scope and mission aligns with documenting our community’s history, food culture, 
politics, government, art, and social movements. 

There were a couple best practices we decided to move forward with: notifying site owners and asking 
permission to host the web archival files as well as taking time to completely, fully describe metadata for 
each of our seeds collected. 

Due to our challenges these past two years we have yet to engage in program activities related to the 
collection. As we finish metadata in the next few days we will be featuring a link on our library homepage 
as well as sharing the project with our community members who so helpfully contributed to the web 
archive. Our hope is to continue the project, adding spontaneous events and new community partners over 
time. 


Brooklyn Public Library 


Over the course of the Community Webs grant, I have created 6 public collections: Brooklynology Blog, 
District 15 Diversity Plan, Local Blogs, Local News, Local Politics, and Neighborhood and Community 
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Organizations. These represent the different strategies and approaches that I explored during the course of 
the grant. 

I initially focused quite heavily on local news sources on the web. I pursued this issue in part by hosting a 
panel discussion during Endangered Data Week, “Saving Local News on the Web,” and writing a follow 
up blog post for the Archive-It blog. 

I quickly realized, however, that our data budget would soon be overwhelmed if I were to continue 
attempting to comprehensively capture the many local Brooklyn news sources on the web. As a result, I 
turned my focus to smaller sites, which also tended to be in more imminent danger of disappearing. I 
experienced some “near-misses” during the relatively short timeframe of the project, where I captured a 
site only to realize it had disappeared from the live web soon thereafter. Unfortunately, I also experienced 
some actual misses, where I failed to capture sites in time. Nonetheless, I feel that the Local Blogs 
collection and especially the Neighborhood and Community Organizations collection provide a valuable 
record that would otherwise not have been captured. 

Capturing institutional web pages was always a goal but not something I prioritized until towards the end 
of the grant, when I made a concerted effort to capture the Brooklyn Collection’s blog, Brooklynology. 
Another late focus was an attempt at more rapid-response, issue-based collecting, inspired by other 
members of the Community Webs cohort. Unfortunately, due to some technical snags and time 
constraints, I only put together one collection with this model, on the NYC DOE’s District 15 Diversity 
Plan. 

Ultimately, I found it difficult to balance this project with my other duties. While we were told that web 
archiving was ultimately a kind of “set it and forget it” undertaking, where there is a heavy initial time 
investment but then only sporadic maintenance required, I did not find that to be the case for the type of 
collecting I ultimately found to be most valuable to my institution. I was constantly learning of new 
community groups and websites and trying to add them to our web archive. I also very often came up 
against technical issues that stymied me. Quality assurance and troubleshooting are time-consuming and 
the amount that was required was overwhelming, especially for someone who came into this grant with 
zero web archiving knowledge. That does mean, however, that I and my institution come away from this 
project with a far greater understanding of just how much it would take to adequately fulfill our collecting 
mission of “documenting the history of Brooklyn from pre-colonial times to the present” when it comes to 
web-based resources. I also wish I could have devoted more time to community outreach around this 
project, both internally and externally. 

As far as “best practices,” most of what I established as my procedure ended up stemming from 
limitations more than what would technically be “best.” Without a larger data budget, we had to stop 
comprehensively collecting news sites. Without a more technically skilled/trained person in charge of the 
project, it was difficult to capture more complex sites effectively. And further, I found it near-impossible 
to capture any social media feeds effectively, even with Brozzler. Prom what I have learned dipping my 
toe into professional web archiving discourse in the past two years, these problems are not uncommon 
even for institutions with full time web archivists (which are few and far between). Web archiving is still 
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a relatively new frontier, and even the most cutting-edge institutions find it challenging. For a small 
special collections department in a large urban public library, this project has been most eye-opening in 
demonstrating what we will probably never be able to do, or at least not any time soon. 

We are grateful to have had the opportunity to build the collections that we did and to capture at least 
some of Brooklyn’s web presence for these two years. We currently have no plans to continue web 
archiving past the timeline of the grant. 


Buffalo & Erie County Public Library 

The Community Webs grant was a wonderful learning experience and an awesome opportunity to capture 
all of the great things happening in Buffalo and Erie County. Buffalo has been undergoing quite the 
transformation over the past few years: large immigrant and refugee populations, construction, 
rehabilitation of historic properties, public art, and historically low unemployment to name a few. This 
grant was perfect for capturing all of the wonderful things happening here. I created collections around 
local government, infrastructure, immigrant/refugee population, festivals and fun, the Buffalo & Erie 
County Eibrary system, arts and culture, the Catholic Church sex abuse scandal. Community groups, and 
education. 

When I hit a slump thinking of what to include in the collection I sent emails to the other Eibrarians in my 
department. The technical name of our department is “Information Services and Outreach”. Since we’re a 
big team of outreach Eibrarians we travel all over the county seeing different places and meeting different 
people. My colleague were great resources for finding websites or groups to include in the collection. 

The cohort meetings were a great opportunity to meet up with other Eibrarians and learn what worked and 
what didn’t from them. I learned that many of us faced the same challenges. The main challenge being 
finding time to do this project correctly while still maintaining all of our other work responsibilities. I did 
not realize how much time metadata could take up until a week after a crawling a huge batch of websites 
and frantically trying to get it done. Thankfully I had another Eibrarian to occasionally help. Another 
challenge that many of us seemed to run into was convincing our Boards/Administrations to keep this 
project up. 

I am truly thankful for being given the opportunity to work on this grant. 
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Columbus Metropolitan Library 


Activities 

Since October 1, 2017 Columbus Metropolitan Library (CML) has archived 602 GB of data through the 
Internet Archive using the Archive-It web archiving application. CML currently has two collections in the 
Internet Archive: 

1. Columbus Websites; which is made up of government and philanthropic organizations in 
Columbus 

2. Immigrant Experience, which is a collection of websites devoted to providing resources and 
telling the story of recent immigrants to Columbus. 

Currently CML has 9 active seeds, crawling 9 websites monthly that have produced over 13 million 
documents that have been preserved, and made accessible through the Wayback Machine. 

Current seeds in Columbus Websites: 

http://columbuslandmarks.org/ 

Advocating, promoting, preserving Columbus Ohio's historic architecture and honoring excellence in 
urban design. 

https://columbusfoundation.org/ 

The Columbus Foundation is the trusted philanthropic advisor to 2500 individuals, families, and 
businesses who have created charitable funds. 

https://www.columbus.gov/ 

City information, departments, events, reports, and elected officials. 
http://thepublicbook.org/ 

The Public Book is a personal record of the Greater Columbus community in 1992. 

The Book was created to encourage grassroots involvement in the city’s commemoration 
of the 500th anniversary of Christopher Columbus’ encounter with the New World 
by inviting local residents to see their own lives as history. 

Its preservation now means that future generations can draw insight and inspiration 

from its pages—into what it was like to be alive in a mid-sized city in the heart of America 

during the closing years of the 20th century. The webpage was archived by CML at the request of one of 

the creators of the book and the website beginning in 1992. 

Current seeds in Immigrant Experience: 

http:// ohiohispaniccoalition. org/ 

The mission of the Ohio Hispanic Coalition (OHCO) is to improve the well-being and quality of life for 
all Hispanics/Latinos through advocacy, education, training, and access to quality services. 
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http://www.crisohio.org/ 

Community Refugee & Immigration Services (CRIS) is an independent non-profit organization that 
serves the growing refugee and immigrant populations in Central Ohio. 

http://www.somaliohio.org/ 

The Somali Community Association of Ohio (SCAO) is an officially recognized 501 (c)(3) non-profit 
organization founded by Somali community immigrants in 1996. SCAO was the first grass-roots Somali 
organization in Ohio, and the first to provide services to the Somali community. 

https://www.bnccohio.org/ 

The Bhutanese Nepali Community of Columbus (BNCC) is a community-based non-profit organization. 

It is a 501(c)(3). BNCC is dedicated to helping newly resettled Bhutanese refugee population from Nepal. 

https://u.osu.edu/dimelocolumbus/ 

jDimelo, Columbus! Is a collaboration between Ohio State’s Office of Diversity and Inclusion’s ,^Que 
Pasa OSU? and WOSU’s Columbus Neighborhoods produced by Leticia Rose Wiggins. Ohio’s Latino 
population is increasing - with Franklin County experiencing the most significant growth during the past 
decade. While the Census Bureau documents this expanding community using the blanket term 
“Hispanic,” the word itself accounts for a myriad of identities such as Chicano, Colombian, Cuban, 
Mexican, Mexican American, and Puerto Rican. 

Outreach 

This year Columbus Metropolitan Library hosted the Community Webs summit where I presented about 
our community archives projects including our digitization efforts and our website archiving that has been 
completed using the Archive-It software. In addition, CML and Cleveland Public Library presented at 
The Best Practices Exchange about our efforts using the Archive-It software for our Community Web 
archives projects. There was high interest from those attending the conference for trying the software in 
their institutions. 

2019 was also the official launch to our community archives project where we are actively having staff 
work in tandem with our library branches and community civic associations to actively collect community 
and neighborhood archives at a hyper local level. We believe this will lead to the need to archive more 
born digital items and neighborhood websites created by civic associations and local citizens. The 
Archive-It tool is an ideal piece of software to archive these often extensive and in depth neighborhood 
websites. 

Funds 

Funds for this reporting period were not used to travel, as CML was the host of the Community Webs 
summit, so there are no expense receipts for this year. Since we did not need to use travel funds for the 
summit we would like to put the $ 1000 in funds towards purchasing server space and services for the 
Archive-It software to archive more web content in the future. 
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DC Public Library 

To date, DC Public Library has created 14 collections as part of the Community Webs program. 
Highlights include: DCist Archives; Neighborhood Blogs of Washington, D.C.; Comics Publishers, 
Artists, and Authors of Washington, D.C.: D.C. Punk Web Archive; Advisory Neighborhood 
Commissions of Washington, D.C.; Ethiopian-American Culture in Washington, D.C.; and Funeral 
Homes of Washington D.C. In order to reach our goal of seed-level description for all collections, a pilot 
program to involve branch librarians in metadata creation was successfully enacted and expanded 
throughout the DC Public Library system. To complement this human-created data, DCPL is also 
harnessing the APIs of both Archive-It and ArchivesSpace in order to semi-autonomously populate both 
guides for our artificial web archive resources as well as details for web archive components in finding 
aids for general manuscript and organizational record collections. Other milestones reached include the 
approval of a collection development policy for web materials and the creation of a suggestion box 
mechanism for both website owners and the general public to expand the voices involved in the selection 
process. 

DCPL has learned that that public and institutional awareness of web archiving technologies, practices, 
and collections is low. We have committed to promoting creative uses of our materials in order to 
increase this awareness (for example, by participating in Archives Unleashed datathons, and potentially 
by hosting datathons ourselves). We plan to continue harnessing Archive-It’s API to automate description 
and increase access. We are also utilizing other web archiving tools to supplement our Archive-It 
collections and plan to continue doing so. 


Denver Public Library 

This year our automated, scheduled web crawls helped us produce a lot more archived content than during 
the first year. I got over the fear of making mistakes or creating imperfect crawl results. I experimented 
with comparing crawls using the Brozzler crawler, and with different scoping rules. 

I conducted discussions about web archives collection development with staff in my immediate 
department. Western History and Genealogy, and with my colleagues at the African American Research 
Library. We determined that there is a lack of understanding of the importance of collecting web archives 
amongst staff and library users, probably because there is a lack of use and knowledge of the Wayback 
machine among those same groups. 

I decided to make an effort to use web archives in the most practical way possible - for us this was using 
web archives to collect serials already in our collection, including online newsletters and publications. We 
realized we could link from MARC right to the Wayback to add to missing “holdings” in the library’s 
collection. If we did this on a large enough scale, it could solve some of our larger collection development 
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challenges that arise when serials go from print to digital. It could also be a good way to turn people on to 
the Wayback who don’t already use it. We need to begin helping users think of web archives as a research 
tool. 

The Community Webs workshop in Columbus in November 2018 was very helpful. The discussions and 
presentations at this event helped me to finally learn not to be hobbled by mixed results in my web crawls. 
Because running patch crawls (and honestly, having time to QC my crawls very holistically) takes time, 
and my job responsibilities are so varied, it was liberating to stop worrying so much about crawl results, 
and just start saving crawls even if they included some “bad seeds.” 

Lessons learned 

At my institution, our collecting areas are huge. Looking back now, I probably shouldn’t have mirrored 
these same large collecting areas in Archive-it. Settying up those enormous collecting areas in Archive-It 
didn’t provide me with any collecting guidance; that collecting “scope” was too broad for this project. I 
should have developed some collecting parameters that were more bite-sized and topical. Luckily I 
figured this out to the extent that in this past year I was able to focus our crawls on some popular subjects 
that are still well within our large institutional collecting scope: 

Neighborhood newspapers and newsletters, and Registered Neighborhood Organizations (RNOs) 

Political campaigns 
Community events and festivals 

These subjects are all ephemeral in their web content, unique to Denver and are likely to be sought after 
by researchers in the future. Had I set those parameters in the beginning, my collecting could have been 
less random and more pointed. The process of selecting seeds to crawl would have been less stressful and 
more guided. 

We added some web archiving language to our near-term collection development guidelines document, 
but I don’t think making the case for web archiving will be hard in my institution. If we have a practical 
application for it (harvesting content represented by a catalog record) then web archiving becomes a 
collection development tool. Also, if over time we can educate users about Wayback as a research 
resource, we will be taking our reference service to an ever higher level. 

Learning web archiving is a task for a team of people, with an overseer who has dedicated time, and 
ideally, a digital archivist role in their institution. The team should include a cataloger, and staff who 
work directly with end users. It is difficult for one person to be an expert in all roles that web archiving 
requires (collection development, cataloging, outreach, programming, reference to customers/users.) This 
work was fun to learn and to explore, but web archiving is a choppy process to oversee. You need to take 
time to choose sites to crawl, get to know the sites so you can compare crawl results with the “original.” 
There are real extended breaks in the process - time to crawl, time to wait for crawl to appear in 
Wayback, time to review results, time to troubleshoot and run patch crawls (I skimped on this part). 
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Setting up scheduled crawls helps with the time management factor, but a web archivist needs to keep 
reviewing sites and adding more seeds to grow and maintain the collection. 


Final takeaways 

Web archiving work deserves a champion! Thanks to this grant, at DPL, we will know and understand the 
importance and time it takes for web archiving to be an integral part of our work now and in the future. 
Dipping a toe into the web archiving world is a start! We wouldn’t have had this opportunity to begin 
without the Community Webs project and grant. 

Web archives are important for research. We need to teach our customers/students about how to use 
archived web resources and why to use them. 

We have learned the multi-faceted value that web archiving can have for libraries. It’s collaborative, its 
built for partnerships (with our collection donors and with branches across our own institutions.) We will 
advocate for this role as important to the future of a special collections within a public library. 


Henderson District Public Libraries 


For the second year of the grant, Henderson Libraries collaborated with the City of Henderson to develop 
a more robust web archive collection to capture the Henderson Community. We focused on Henderson 
businesses and organizations, community blogs from currently established, prominent and quickly 
developing neighborhoods. Local government included many twitter accounts of figures currently in 
office and those running of office. We expanded our casino, food, and drink collection to capture how 
great this area is in Henderson. The education collection was created, and added to by City of Henderson 
staff The Henderson Libraries web archive was continually updated with press from local news outlets on 
Henderson Libraries events and programs. The Southern Nevada Climbing collection was created with 
the help of the Southern Nevada Climbers Coalition. This was an important addition for year two, seeing 
as Southern Nevada is a worldwide climbing destination. The second year was really about rounding out 
the collections, and then focusing on maintain current collections. We also had a huge change in our 
community with the addition of the Raiders and Golden Knights sports team to Las Vegas and Henderson. 
Both teams have practice facilities in Henderson, and have chosen to collaborate with the City of 
Henderson and Henderson Libraries on cross promotion. We wanted to make sure that the lives and 
interests of individuals/communities otherwise not represented in our collections were properly captured 
with the help of our partnerships with the City of Henderson and SNCC. 


The seed nomination feature was added to the Henderson Libraries website recently. We’re planning on 
advertising this through local news outlets to allow the public to nominate websites they believe would fit 
within our collecting scope. 
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Our goals for collecting have remained the same since the first year. We will capture anything that helps 
capture the current lives and interests of the Henderson community with a large focus on education and 
local government, as well as recreation. 

We’ve learned to document everything that we have done in regards to web archiving. It is imperative to 
learning the process and finding solutions to any issues. Going back to any past experiences has been 
incredibly helpful, and will be helpful when creating a succession plan for someone taking over 
responsibilities. 


Kansas City Public Library 

The Kansas City Public Library has been an active and engaged participant in the Community Webs 
program as a member of the expanded cohort since Fall of 2017. 

Since launching the web archiving program we have captured 752GB of data from over 7 million 
documents. Since last year’s report we have expanded by creating two new collections: Nonprofit & 
Education and Local History. 

The past year has largely been spent fine-tuning our Archive-It presence and ramping up our crawling 
activity. The majority of our collections and seeds now have at least descriptions in their metadata 
records, with many of them containing subjects, creators and contributors as well. As the collection 
overall grew, we focused on this to provide a better sense of professionalism and completeness for patrons 
who visit the public Archive-It page for the Library. 

However, creating metadata for every seed in the collection did prove to be time-consuming and we were 
not able to accomplish it. Even basic, abbreviated records take a significant amount of staff time to create, 
given the scale of our efforts. We have considered adding this to the responsibilities of our technical 
services department but have not fully explored it. As it stands now, we will continue to periodically flesh 
out the metadata and revisit regularly so that we are continually making progress. 

A 2018 primary and 2019 mayoral election provided an opportunity for us to perform some targeted 
crawling related to the candidates and the issues. We crawled the primary candidates’ social media feeds 
and websites in an effort to preserve the robust discussion on local topics taking place online. 

A great idea from our last cohort meeting led us to crawl the obituaries posted by local funeral homes. 

The scope of this effort was daunting, but we were able to crawl 20 local funeral homes, achieving at least 
partial captures of the content. 

Last Fall, our repository of local history images went down for an extended time period due to a software 
migration. Having complete captures of the site on Archive-It allowed us to redirect users to the snapshot 
versions of various pages. We even implemented a search box to allow users to search the archived 
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version and pull up photos they wanted. While not a perfect solution, it’s a good example of the value of a 
robust web archiving program - you never know when it will come in handy. 

Last, I am happy to report that I have successfully obtained a budget to continue our Archive-It 
subscription when it expires. This is a permanent line item in my departmental budget that will permit us 
to continue to crawl and capture vital local web content! 


Lawrence Public Library 

Since last year’s report, we’ve expanded our collections from three to seven: Local Government, 

Lawrence Public Library, Arts and Culture, Education, Obituaries, Local History, and Community. 

• The holdings in the Local Government collection focus on the website of the City of Lawrence, 
Kansas, and in particular the agendas and minutes of the City Commission 2019 is a local election 
year, and so we are currently collecting websites and social media feeds for city commission and 
school board candidates. Our local police department has a very active and on point Twitter 
presence, so our Local Government collection also includes periodic captures of that feed. 

• The Lawrence Public Library collection captures the website of the Lawrence Public Library. 

• The Arts and Culture collection primarily focuses on a website with a vast archive of virtual tours 
of local historic homes; we have not yet made the Arts and Culture collection public, as we 
continue to struggle with technical difficulties with capturing the multimedia resources that make 
up the virtual tours. 

• Credit for the idea to create an Obituaries collection came out of discussion at the Community 
Webs meeting in Columbus. Technically, capturing obituaries has been somewhat challenging but 
we seem to have ironed out most of the kinks and are having good results with that initiative. 

• The Education collection focuses on capturing the websites of our public school district, our 
community’s private schools, and our technical school. The University of Kansas is 

• not included in this collection as they have their own Archive-It account. 

• Our Eocal History collection includes websites and social media feeds that focus on distinctive 
local figures or events of the past. 

• The Community collection includes the websites of neighborhood associations and local 
organizations, and social media feeds and multimedia artifacts that capture depictions of place, 
the activities of organizations, or topics of community interest. 

To promote awareness of web archiving in general and our participation in Community Webs in 
particular, this summer we have an intern whose work is devoted to soliciting suggestions for web 
artifacts that reflect our community. She is doing considerable outreach to community 
groups and through the library’s social media to build awareness of the project. 

In terms of lessons learned, it’s clear that web archiving requires at least one person who can 
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devote at least a few hours a week to the initiative. Working with Archive-It has a learning curve that 
requires dedicated focus, and the changing landscape of website architecture makes quality control a 
matter of ongoing vigilance—this is not a set-it-and-forget-it operation. We’ve dedicated one staff member 
(who came to us with previous Archive-It experience) to serve as the point person for the nuts-and-bolts 
problem-solving associated with this project. It’s also apparent that building buy-in even in a progressive 
cultural heritage organization is an uphill battle—I’ve seen a lot of evidence of a sort of cognitive 
dissonance, wherein individuals cite examples of web-based resources that need to be saved, and yet 
repeatedly express confusion about the purpose of web archiving as an initiative. This also means that 
considerable energy has to go into awareness raising and education for the general public to understand 
what web archiving is and why it should be done, in order to grow their participation. 


Pollard Memorial Library 

Due to some complications around the government shutdown, the archival group that we were using to 
bounce ideas off of failed to meet for nearly seven months. We were using them to help us to identify 
websites that we would need to preserve and guarantee to existence in the future. Because of this delay, 
staff members started to identify and compile listings of sites to capture, which we are in the process of 
doing right now. 

We are focusing on two key areas of Lowell’s community and history with the first collection being 
Lowell History, including various sites of specialized archival groups and local organizations and blogs 
committed to history. We realize that many of local historians are getting older and the younger 
generations do not have the mental knowledge (yet) of some of the unique parts of history. Some of this 
information has been captured through their local websites and some of the organizations are facing 
challenges related to sustainability, as they don’t have a younger group to take up the mantle. As such, we 
are beginning to capture these pages to ensure that this information is available in the future for those 
looking for information and research about Lowell History. 

The second collection is focused on Lowell culture. The City has been a haven for immigrants, refugees, 
and various ethnic groups throughout its existence. Starting with mill girls through the Irish, Greeks, 
Southeast Asians, and to those of today from Middle Eastern and African countries, Lowell has 
established a community space for these groups and many of them honor their heritage through various 
celebrations around the city. It is important that we capture the groups celebrating, those who support 
them, and also the activities that they use to demonstrate their heritage. 

We have 2 additional collections related to the City of Lowell and our library. We are looking at other 
departments to highlight in these areas, but most of the information for these sites resides on the currently 
City Hall and Library websites. 

Much consideration has been given around history and culture in regard to social media. We are just 
starting to identify those social media platforms that will demonstrate culture in a positive manner. An 
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issue that we are having regarding social media and historical information (photos, flyers, postcards, etc.) 
is that the Internet makes it too easy for people to break copyright, steal images, remove watermarks for 
protected images, and misrepresent items as their own. There was an incidence of one site we were 
looking at that downloaded images from the Library of Congress and the local historical society and then 
was adding their watermark to the images and posting them online. We do not want to preserve those sites 
that are demonstrating blatant disrespect for ownership and since we want to ensure that the legacy of the 
items online is duplicable or can be made available in the future for research, personal understanding, and 
the like. We are having a difficult time finding reputable social media outlets that correctly identify their 
items and give credit to the original sources. The comments are interesting and highlight aspects of 
history that are important or through crowdsourcing, are able to identify sites, people, etc. This personal 
feedback is helpful in filling in the blanks historically and why we were interested in capturing these sites. 
The lack of consideration for copyright and, at times, outright stealing of items to boost the popularity of 
a social media site is tough to reward. We will likely begin to capturing the social media sites associated 
with local historical and cultural organizations, but need to be aware of those sites that are less reputable. 
This research, unfortunately, takes time and the scrutiny needed to ensure a valuable resource is not 
always easy. 

We will continue to work on capturing the historical and cultural sites and news of the city of Lowell. 
We’d love to continue capturing this work in the future to maintain down the road. We have limited 
budgets related to this type of archival collecting, but it would be wonderful to see if we could continue 
this in the future, especially in regard to capturing what’s happening in the city. 


Sonoma County Library 

Sonoma County Library joined the Community Webs program when disaster struck in the form of the 
October 2017 wildfires, prompting the History & Genealogy staff to seek a way to capture a sliver of the 
stories, videos, photos, discussion, organizing and healing that lived online in the aftermath of the fires. 
Maria Praetzellis of the Internet Archive, herself a Sonoma County resident, created the opportunity for 
the library to build a web archive of the fires - 

North Bay Fires 2017 - under the Community Webs program, where the library could benefit from the 
training, funds and cohort to meet the critical need of our community. 

The focus of the North Bay Fires collection was content at risk of disappearing from the web such as 
blogs, the websites of neighborhood organizations and newly-formed groups, and the social media 
presence of such groups. Capturing the content of local government websites and news was also a priority 
in order to understand how agencies and infrastructure functioned during the disaster as well as their role 
in the rebuilding process. The main county newspaper. The Press Democrat, won a 2018 Pulitzer Prize in 
the Breaking News category for its coverage of the fires. Being that the newspaper does not archive its 
online edition, the North Bay Fires web archive has become a unique repository for this invaluable 
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content. The geographic scope of the web archive was Sonoma County; the most active period of 
collection was from November 2017 through the October 2018 anniversary. 

Sonoma County Library staff participated in fire remembrance events as well as presentations on History 
& Genealogy Library resources where we explained and demonstrated the web archive. Once such event 
was an open house, where alongside other special collections, staff showed visitors the North Bay Fires 
web archive on a laptop and answered questions about the scope and content of the collection. In most of 
these encounters, we found that members of the public were largely unfamiliar with the concept of web 
archiving and needed a personal demonstration of how to access content to begin to understand the 
collection. Once they reached an understanding, they often had ideas for content to contribute to the 
archive. 

Throughout this project, we learned the challenges of archiving online news, namely it is dynamic, hefty 
in data volume and requires careful oversight. We also learned that adding basic metadata when creating a 
new test crawl is a best practice that would save time researching and re-familiarizing oneself later down 
the line. 

Looking forward. History & Genealogy Library staff have requested funds from the system-wide library 
budget for web archiving aimed at under-documented communities in Sonoma County, particularly 
Native American, farmworker, youth and women activist communities. We also hope to form web 
archiving partnerships with the City of Santa Rosa, whose recent open government initiatives dovetail 
with web preservation, as well as with The Press Democrat. We will continue to closely follow the 
research on metadata harvesting and collaborative collecting—two elements we believe are fundamental 
to sustainable web archiving. 


Westborough Public Library 

Westborough, MA is a town of around 18,000 people positioned between Boston and Worcester. We 
were primarily a farming community that then turned to industrialization and then morphed into 
becoming a corporate park community—and all of these phases in our existence have a physical presence 
in our town. We still have a few working farms, although most of the industrial buildings have 
disappeared due to fire. We still have a sizeable number of families living in town who trace their 
ancestors back to the original founding families, but we also have a fast-growing Asian population that 
makes up almost a quarter of our population, mostly from South Asia. The town recently installed a 
public cricket pitch where the Westborough Cricket Club plays, and a Sikh Temple recently opened up. 
The diversity and relative wealth of our town have created a robust setting for lots of restaurants. We have 
active community clubs—Rotary, Civic Club, Garden Club, etc.—and we still function under the 
Massachusetts Town Meeting form of government. We enjoy our small-town living, and the communal 
enthusiasm for everything the town represents is palpable. Westborough loves its history, which makes 
my job as the Local History Librarian a lot easier. We have many times had so many people showing up 
to history programs that we have had to turn some of them away at the doors. 
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My biggest challenge as the Local History Librarian has been finding a way to include our large South 
Asian population in what I do and what I collect. As a part of the effort to overcome this challenge, I have 
been collecting websites in Archive-It, and what strikes me now that my overall collection is beginning to 
take shape is how much the collection of websites come together to capture the spirit and nature of our 
town—something that would have been impossible to do if I were confined solely to collecting print. 

With Archive-It, I can collect both little league and cricket websites, both the Congregational Church and 
the Sikh Temple websites—and the fact that they sit next to each other in the Archive-It collection gets to 
the heart of what Westborough is all about and what it aspires to be. To be sure, the various cultures that 
make up our community continue to exist for the most part in our separate silos, but there is a clear 
feeling on all sides that we don't want to live that way. We just haven't figured out how to cross these 
bridges yet (to mix metaphors). But our library is determined to be a part of this process, and Archive-It is 
beginning to shape up as an important tool in helping to bring about this change. 

Here are a few of the collections that I have created: 

Westborough Celebrations and Events - This collection includes website and Facebook activity 
relating to our town's 300th anniversary celebration in 2017.1 only have a small collection of 
physical documents and objects that I was able to pull together throughout that year, so this 
Archive-It collection captures and preserves much of the digital data that most assuredly would 
have disappeared had I not grabbed it. We have other physical collections relating to past 
Westborough celebrations, so this collection will allow us to continue to build on this theme. 

Westborough Sports -1 created this category to capture the founding of the newly created cricket 
team, because it serves as a sign of our changing demographics in town, and it now sits next to 
ten other websites focused on sports in Westborough. 

Westborough Town Government - Our physical collections include town records going back to 
the eighteenth century, so capturing the digital manifestation of our town government as 
presented to people today is important. 

Westborough News -1 started with capturing obituaries in newspapers, since I get questions from 
people seeking them a lot. From there, I have started collecting the newspapers themselves. This 
category presents my biggest challenge, because the data from my crawls eat up a huge 
proportion of my allotted storage budget. My crawls regularly are cut off because they reach the 
data limit I had to put in place. But because our local newspapers are no longer available in 
microfilm—and have not been since the 1970’s—I believe it is important to keep collecting them. 
I continue to experiment with finding the right settings, but success on this front has been limited. 

Other collections I have created include: Westborough Religious Institutions, Westborough Social Media, 
Westborough Arts and Culture, Westborough Foodways, Westborough Community and Civic Clubs, and 
Westborough Obituaries. 
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When I met with our library director and IT person to plan our collection development strategy for 
Archive-It, we decided that our current collection development policy could serve as a guide simply by 
adding "websites" to the formats that we collect. But I also began to wonder if our traditional approach to 
collecting in general is too "old school." My room looks pretty "19th century," and while I want to be 
mindful of building continuity between past and present, I have to ask: are we at a critical moment where 
a paradigm shift is in order? At the time this question occurred to me, I had read an op-ed article by 
Thomas Friedman in the New York Times where he put forward the idea that we are only in the second 
inning of the Digital Revolution, and news about Facebook and its business model was just beginning to 
dominate the airways. Is the local history collecting strategy that was developed during a time of rapid 
changes coming out of the Industrial Revolution still appropriate for our time? Do we need to reconceive 
what a Local History Program is, and rethink what, why, and how we collect? The answer is obvious. 

Yes! 

In fact, since that time, I have developed a whole training session for the Digital Commonwealth that is 
designed for people working in local history cultural institutions. As a part of this training, I use 
Maslow’s Hierarchy of Needs as a strategic model for outlining how our programs can go from what I 
call a Nineteenth-Century Paradigm, which currently dominates the practice of local history today, to 
creating a Twenty-First-Century Paradigm. These training sessions have been so successful that I am now 
putting together a book proposal based on them, and Archive-It is a key component in helping me to 
imagine how local history practices need to be transformed for now and for the future. 

Even though the Community Webs program urged us to work with our community to create an advisory 
board for collecting websites, our committee did not see the value of doing so at the time. Our town 
seemed too small for such an entity, mainly because our community came out of our 300th anniversary 
celebration exhausted. Assuming I could assemble such a committee, it most likely would have been the 
same overworked volunteers who already have a voice in how we do local history. Instead, I created an 
online suggestion page where people can submit suggestions through a link next to the Archive-It one. 

But to date, I have yet to receive a single submission. We still somehow want to involve our community 
in deciding what we want our Local History Program to be and what it should collect. Can we create 
programs where we can solicit this feedback, but in the process make it interesting and beneficial to the 
people who participate? 

Ultimately, I think the answer is going to reside in my intention to create a Westborough History Working 
Group. I am first going to start with asking such a group to use its collective knowledge to apply better 
dating to the photographs in our collections, especially the ones that do not have any dates attached to 
them (which is the vast majority). Once I have some active members in this group, I can then expand the 
list of activities in which they engage. In the end, I think that this group will end up serving an advisory 
capacity to our Archive-It resource, in addition to helping us rethink what we do in other ways. It is quite 
possible that we were not yet at the stage in developing my local history program even to think about 
putting together an advisory group back when the committee first met, but now I have a clearer sense of 
the kind of participation I will need from my community to create a local history program for today. 
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In the midst of all of my thinking about how to create a Twenty-First-Century Paradigm for local history, 
the Archive-It part of my overall program suddenly seems small. We are no longer simply coming up 
with a collecting strategy for websites, but reconceiving why we collect, how we collect, and what we 
collect. Archive-It is only a part of this larger strategy to transform what we do. Then again, without 
having Archive-It as a component or collecting tool, I would not have been able to start asking the kinds 
of questions that need to be asked about what our collections are, how we go about putting them together, 
and what we do with them. The value of Archive-It may not be seen for years to come. That’s okay. I’m 
patient. But like any collection, the more robust it becomes, the more useful it becomes. I have no doubt 
that the Local History Librarian who succeeds me a hundred years from now will appreciate the fact that I 
put these collections together, so that people then can gain better insight into who we are now. 
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Community Webs Year 2 Survey Responses 

Name 

Jacquelyn Oshman 
Anthony Vaver 
David LaCrone 
Abbie Zeltzer 
Dana Bullinger 
Diana Bowers-Smith 
Shanley Olszowy 
Joanna Kolosov 
Natalie Milbrodt 
Agatha Monahan 
Makiba Foster 
Hoan-Vu Do 
Emilirose Rasmusson 
Bridget Cooley 
Julie Jabberer 
Melinda 

Abby Hoverstock 
Mel Gooch 

Melissa Fisher Isaacs 
Anke Voss 
Ashley Shull 
Chatham Ewing 
Aaron O'Donovan 

Institution 

New Brunswick Free Public Library 

Westborough Public Library 

Kansas City Public Library 

Patagonia Public Library 

Henderson District Public Libraries 

Brooklyn Collection at Brooklyn Public Library 

Buffalo & Erie County Public Library 
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Sonoma County Library 

Queens Library 

West Hartford Public Library 

Schomburg Center for Research in Black Culture 

San Diego Public Library 

Marshall-Lyon County Library 

Pollard Memorial Library 

Grand Rapids Public Library 

Shelton 

Denver Public Library 
San Francisco Public Library 
Lawrence Public Library 
The Urbana Free Library 
Athens-Clarke County Library 
Cleveland Public 
Columbus Metropolitan Library 

Please describe your library's approach to collection development and 
policy relating to web archiving 

- Primarily crawling websites of local non-profit organizations, city government, and cultural 
institutions. If we are asked to remove a site, which we have not, we will gladly remove it. 

-I decided that the collection development policy that our library has in place for archival 
materials sufficiently covered the collecting of websites, so I did not make any changes to it. 
Since our town is relatively small—and because I am an active resident in town-we decided that 
to create a focus or steering group would be overly burdensome. We have way too many similar 
groups in town, and most likely such a group would be filled with people I already know. Instead, 
we decided that I would make a concerted effort to be even more active in the community and to 
keep a constant lookout for collecting possibilities, as I would normally with any other archival 
collection. I did create a "Suggest a web or social media site" page and publicized it, but I have 
yet to receive any responses. 

-We have a pretty broad collection development policy. It directs us to collect sites created by 
local persons with a focus on a specific neighborhood or community of practice. Special 
consideration is given to at-risk sites and those that document underrepresented groups. 
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-Web archiving was introduced to the community and iocai organizations at in person and group 
meetings.Consensus was reached on the focus and content of our deveioping coiiection, it faiis 
within our Mission as a guardian of iocai history. Archiving digitaiiy born materiais was added to 
our existing Coiiection Deveiopment Poiicy, discussed with staff and the Library Advisory Board, 
and then approved by the Town Councii. 

-Our Digitai Project Librarian works with community partners such as the City of Henderson and 
the Henderson Historicai Society to identify websites for archiving. Our seiection process is 
based on the foiiowing criteria: reievance and vaiue of subject matter to our community, sites 
that in some sense at-risk (inciuding but not iimited to spontaneous events such as disasters 
and trending sociai topics), and compiementarity of websites with existing print and digitai 
coiiections heid at Henderson Libraries. Websites affiiiated with Henderson Libraries, and those 
of organizations whose print archives are heid at Henderson Libraries are high priorities for 
archiving. We aiiow the pubiic to nominate websites they beiieve are a vitai aspect to capture for 
the Henderson Community. These sites are assessed by the Digitai Projects Librarian to make 
sure they fit the coiiecting scope before they are captured. 

-Our web archive coiiecting poiicy started as the same as our anaiog coiiecting poiicy, which 
essentiaiiy boiis down to we wiii coiiect anything Brookiyn-reiated. However, we have adjusted 
our approach through this program and are aiming at more specific goais in our web archive for 
year 2. 

-Free range, i'm ieft to my own devices with a supervisor occasionaiiy checking in. 

-SCL joined CommWebs at a iater date than the originai cohort in response to the wiidfires of 
October 2017. We started a web archive coiiection in November 2017 without deveioping a 
poiicy or pian. However, as we began to coiiect URLs, we reaiized the need to set a scope for 
the coiiection, nameiy geographic and genre/format/type (websites, Facebook, Twitter), and a 
focus on coiiecting iocaiiy produced content. We chose not to activeiy coiiect NextDoor and 
Facebook content that required joining a private group. 

-We have invited our Gov Affairs, Communications and Archives Departments to curate our 
crawis. We have aiso inciuded websites of iocai community partners such as wikitongues and 
iocai musicians. 
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-For Web Archiving, our iibrary has foiiowed the same guideiines as for other materiais that we 
coiiect for the Locai History coiiection. We have crawied sites reiating to the town government, 
schoois, significant buiiding and transportation projects as weii as foiiowing business activities. 
However, i aiso expanded to crawi a site, the i84 project, which wiii impact West Hartford 
residents, though the specific area of highway is in the adjacent town of Hartford, i pian to 
reduce crawiing that site because even though it wiii impact West Hartford residents who use it, 
it is not necessariiy unique to West Hartford and that what i feei i shouid concentrate on in this 
second year. Aiso, this particuiar project wiii most iikeiy be covered by the State of CT. 

-Our coiiection deveiopment poiicy regarding web archiving is an effort to aiign the web archive 
with the physicai content of the Schomburg Coiiection. The divisions of the Schomburg Center 
is subject and format driven so our web archive wiii reflect community reiated content that aiigns 
with the scope of the 5 research divisions. For exampie, the Art and Artifacts Division coiiects art 
and artifacts by and about peopies of African heritage throughout the worid. We are working to 
to document the web presence of community art coiiectives and up and coming artist who have 
a web presence. We have been in discussion with NYARC regarding possibie cross 
coiiaboration. 

-We are aiready coiiecting materiais based on San Diego so it was simpie to broaden what we 
coiiect to aiso inciude websites. 

-Currentiy, we have no formai coiiection deveiopment poiicy for digitai or born-digitai items. We 
are aiso in the process of rewriting our very oid coiiection poiicy, wherein digitai and web 
archiving wiii be discussed. The specifics of that inciusion wiii depend on what we are abie to do 
after the grant period has ended. 

Currentiy, we have been oniy focusing on the iibrary and city's pages with some iimited iocai 
history. We'ii iikeiy be expanding this by the end of the year to capture iocai organizations, 
churches, and non-profit festivais and ceiebrations. 

-Our initiai focus for web archiving has been 1) news media, especiaiiy oniine-oniy pubiications, 
2) our own websites and sociai media and 3) requests from archivai donors. We are aiso adding 
iocai biogs and ephemerai content as we come across them, mostiy sites that provide a 
non-dominant opinion and couid disappear at any time, in the next phase, we’re iooking at 
crawiing sites reiated to our existing physicai coiiections. We aiso want to coiiect oniine 
obituaries, many of which are not pubiished in the newspaper at aii. 
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-We have not yet written a formal collection development or policy guidelines for web archiving, 
and need to do that. 

-The Birmingham Public Library has a collection management policy that reflects the influences 
of the community served, provides a basis for acquiring material and information, defines the 
library’s mission and goals for the collection, sets coherent boundaries for the collection, 
provides for systematic management of the library’s collection, sets a systemwide standard for 
the organization of materials, serves as a training tool for library staff, and promotes the sharing 
of materials between locations. In addition to guiding the acquisition and disposition of library 
materials, this policy also incorporates a preservation initiative for the library’s historic and 
valuable special collections. 

The Birmingham Public Library strives to develop good collections to meet the needs of online 
users, to offer information equitably across the system, and to provide information in the most 
useful presentation. The value of online information includes such attributes as currency, 
greater search capabilities, systemwide access, convenience, comprehensiveness, and visual 
appeal. 

While online resources provide more current information, easier access, and better delivery, 
they come with a high price and require thought. 

The library's online resources are in electronic formats and accessible only by computer. These 
resources can be developed in house, stored locally, or purchased by subscription. Virtual 
resources include: 

• Online subscription databases 

• Databases and indexes produced by the library 

• Subject-resource pages 

• Downloadable content such as books or music 

• Digitized versions of library materials 

• Social software 

• Websites - General considerations for selecting include subject content, 
comprehensiveness and accuracy of information, ease of use, and currency. Preference is 
given to sites produced by government agencies, educational institutions, and other 
noncommercial sources. Selectors also strive to include a variety of sites for the purpose of 
providing a range of viewpoints on both current and historical issues 

• Ebooks and ejournals 

Selection Guidelines: As the largest and one of the oldest public libraries in Alabama, 
Birmingham Public Library holds extensive materials of local and regional importance. Selecting 
material to digitize or capture requires careful thought, knowledge of the collection and the 
community, and organizational skills. In general the following are taken into consideration in the 
selection process: 

• Intellectual Property Rights 

• Value 

• Usefulness 


5 




Community Webs Y2 Survey Responses 


• Technical Considerations 

Maintenance Guidelines 

Because technology is constantly changing, technical provisions must be made for archiving 
and maintaining digital resources. Technical considerations for maintaining digital resources 
include the following: 

o The equipment for production of and access to the product is in place, 
o The plan for software upgrades is current and kept up to date, 
o The product will survive platform migrations. 

o Continuing education in technology is provided for the production staff, 
o Current best practices are in use. 

o Backups are in place and operate on a fixed schedule. 

o Policy and procedures for correcting and updating technical processes are maintained. 

-In my public library, changing up the language of our collection development policy is no small 
task. All policy changes must be approved by the Library Commission, so this goes above the 
heads of even our City Librarian. As a result, changes to policy are not taken lightly. The good 
side of this equation is that the collection development policy is necessarily non-specific, so as 
not to be limiting. Our departmental staff focuses on collecting areas without getting into details 
of media formats or file types that we do or do not accept. For us as a work culture, naming web 
archiving as a collecting activity or area of focus would be out of place in a statement that 
instead errs on the side of being non-limiting, inclusive and non-specific. In our department, we 
avoid changes to policy by making internal changes to our procedures, which do not undergo 
outside scrutiny. Here is our collection scope statement, revised in 2018. With a growing 
collection of web content in 2019, I will propose that “archived web sites” be added to the format 
list at the top of the paragraph in our scope statement. While web archiving is not yet named, it 
is currently addressed, if non-specifically, in the final sentences of the statement: 

-The Western History collection includes manuscripts, photographs, books, serials, maps, civic 
studies and research, audiovisual media and art. Materials support research west of the 
Mississippi River, including Alaska and Hawaii. Building upon an unparalleled collection for 
study of the Rocky Mountain West, today’s selectors seek to preserve Denver’s diverse 
metropolitan history since the Great Depression. Selectors also add related scholarship from 
history and other disciplines to sustain and inspire research. The collection aspires to be the 
City and County of Denver’s official publication and research repository. Materials take physical 
or digital forms to preserve and afford access. Challenges of media form or format are 
secondary to historical import. 

-We are a little behind in finalizing our collection development guidelines due to staff leaving the 
department and a new manager taking over management of SF government info. For San 
Francisco government sites we are going to work to fill gaps in our collection, as well as create 
parallel collections documenting the websites of entities that are required to provide documents 
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to us under the San Francisco Administrative Code. For community websites, we are going to 
try to complement areas which have dedicated subject experts on staff. 

-For policies, for non-government sites we are still considering what efforts we may take to alert 
people in advance that we will be crawling their sites. It’s been really helpful to look at the other 
institution’s policies shared through the course. 

-Our policy related to our local history collection-which stipulates that our collecting should 
center on resources related to Lawrence and Douglas County, and serve as a supplement to 
the more robust collections of the two local history repositories in our community-has continued 
to serve as our framework for web archiving. 

-Working with our community partners, and making them an integral partner in the process. 
Using existing collection development policies, with the intention to make edits in our regular 
January 2019 review, we are web archiving local content. 

-We have a written policy guiding web development as it relates to other collection priorities. 

The policy for web archiving is the same as our local history collection scope. That is; put simply 
the collecting of materials of organizations and individuals important to the growth of our 
community. 


Was your library involved in conrimunity outreach activities/public 
progrannnning? If so, please describe the impact this had on your web 
archiving program. 


-We created brochures to hand out at New Jersey Library Associations Annual Conference and 
at a local history fair. It is unknown how this impacted traffic to the site. 

-We have had a LOT of change in our local history program, and Archive-lt has been swept up 
as an important part of this change. We re-branded our local history program from the 
"Westborough Room" (just dreadful!) to "The Westborough Center for History and Culture." We 
have changed our focus on mostly preserving the past to helping people become more involved 
in documenting and sharing our town's history and culture. The "culture" aspect of the program 
allows us to celebrate Westborough, with an eye towards capturing our present for future 
generations, so that they can understand what our town is all about at this point in time. 


7 



Community Webs Y2 Survey Responses 


-While we developed an outreach/engagement plan, we have not yet implemented it due to 
competing organizational priorities. Our low-key and opportunistic promotions have resulted in 
approximately 12 sites being included in the archive that we were previously unaware of. 

-Yes, our library was and is involved in community outreach activities. In a community the size 
of Patagonia, one-on-one or small group discussions work well. In addition to our local 
newspapers highlighting our web archiving program, an update will be included in our Friends of 
the Library newsletter. 

Our collection is still in its infancy and prior to any public programming, the collection will be 
enlarged and have added metadata. At that time we will host a collection viewing at the library 
and supply participating organizations a link to the collection. 

-In October 2018, we launched our web archiving landing page in our new Local History 
Collection webpage. Since then, we have been actively promoting the web archiving program at 
our programming events and outreach activities. It has created more awareness of the program, 
and has increased the public's interest in participating in nominating websites. 

-Yes, a few presentations and a panel discussion. They were useful experiences for thinking 
through and shaping our approach. 


-no 


-We did not carry out public programming around the building of our web archive. 

-We did one two-day course for library grad students and haven’t gotten our two other trainings 
together yet for allied cultural heritage workers around the borough and a middle school class... 
but we will! 

-Yes! The library sponsors many programs and we also do outreach to schools and other 
community organizations. The library was part of the Memories of Migration IMLS grant from 
2014-2016 and I captured the blog for 2016. The library continued this program after the grant 
ended and I also was able to crawl the blog for 2017. This past summer even though the Teen 
department continued the program the blog was not updated. However, interviews were 
conducted that are in the process of being saved in an archival quality format. I have asked the 
Teen Library who is the leader of the project, to let me know when they are done so I can crawl 
those files provided they are not too big. 
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-As a Research Center we are heavily involved with community outreach. Some of the 
community programming we do that is streamed live to international community is now a part of 
a Schomburg institutional Archive-lt collection Schomburg Livestream Archive to preserve those 
talks published on the web. Also some of our public programming has inspired ideas for new 
collections in Archive-lt. 

-For the first year of the grant I was working with colleagues within my organizations to create 
web collections. For the second years I'm planning on doing more community outreach activities 
such as focus group and collecting survey from the public. I think public input is very important 
with our web archive program because we want to know what the public is interested in. 

-We approached a few groups and received a lot of blank looks. It wasn’t merely a case of not 
seeing the importance of archiving web content but also lack of web content in general. This is 
something we’ve noticed as we’re attempting to put more of our content and events on various 
sites; few organizations or groups advertise digitally or have a consistent digital presence. 
However, fewer events are advertised in traditional media than were previously. To try to 
capture more of those events, we have begun scanning posters for local events that are hung 
up at our library and are working out a good way to archive them. We also specifically archive 
community flyers in an attempt to capture some of those event listings that wouldn’t make the 
paper. Also, even after several months of emailing back and forth, we are still having trouble 
getting our local county website archived without resorting to ignoring robots.txt. 

-We have been working with a local archives group to identify organizations to capture related to 
culture and local history. They will assist in helping to suggest sites to document. 

-Not yet 

-Our Archives Department has been involved with many of the same organizations we are 
interested in archiving their websites. Through meetings with these groups, we’ve received 
permission to archive their websites. 

-Our department is regularly involved in outreach to specific student groups. We offer class 
tours and visits to groups of all ages and experience levels. We also find ourselves supporting 
community events outside the library by providing research support. As the one staff person 
focusing on web archives, I have made small strides in connecting community need to my role 
archiving the web. Examples include: working with customers and colleagues to locate serials 
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that can be found online. I have subsequently crawled local organization’s sites that I learned 
have serials (newsletters usually) on their websites and are websites with a small, local reach 
and scope. I am working with our catalog librarian to determine ways to create consistent 
metadata to point to these resources online from our catalog. Another example is adding seeds 
to crawl based upon the library’s partnership with festivals or traveling exhibits, such as a 
Redesign the Redline exhibit (about housing discrimination against people of color) coming up 
in November. A couple of examples that haven’t worked so well are when I taught a class (open 
to the public) about archival resources at our library and mentioned that libraries everywhere are 
increasingly archiving web sites, and I showed them the Wayback machine. No one in the room 
knew about using this resource for research. Some couldn’t think of a time this resource would 
have come in handy for them (yet.) When I asked them if they knew of web sites that were 
important in their daily lives and research projects that we could consider adding to our list of 
sites to crawl, I got a lot of blank stares. Similarly I put together a blurb about web archiving for 
our newsletter which is sent out to our 1000 person subscriber email list. I asked for suggestions 
about websites people couldn’t live without. Could they email me their ideas and comments? I 
didn’t get a single response! Again, I think the concept may have just seemed foreign and I may 
not have honed my approach to my audience yet, but I think that most public library researchers 
don’t even yet think of the wayback web as a thing they could use in their research - many 
people just don’t know it is there. 

-We have not hosted any public programs centers around community archiving. In early 2019 
we will be working with our affinity center managers(Affinity Centers include our African 
American Center, Filipino American Center, LGBTQIA Center, Environmental Center), to work 
with them to conduct outreach to their communities to solicit suggestions of sites to archive, and 
then we will create a collection for each affinity center. 

I believe this will help us spread the word internally to staff about the work we are doing with 
web archiving, and give us an opportunity to provide some training to staff about the importance 
of web archiving. It will also help give a broader reach into the communities in SF since the 
affinity center managers are in close contact with the communities they serve. 

-Yes. This summer we hosted an intern whose project ("You Are Local History") was to reach 
out to the community to digitize objects that told stories about the experience of living in 
Lawrence, Kansas. As part of her project, she also solicited input from community members 
about websites that reflect local experiences. 

-not yet 

-We are just beginning this process on a grass roots level by visiting and talking with local 
community organizations and leaders. 
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-Yes. It helped. 


-Yes, we worked with community organizations in the past, this process made us think about 
underrepresented communities and gaps in our records to create a more complete picture of 
our community. 


Have you (or do you plan to) provide access integration for your WA 
collections with other library systems? For example, incorporating WA 
data into your library catalog. 


-no 


-I don't know what this means, so I guess the answer is no. 

-While we will make the archive publicly accessible, we do not have any plans to involve other 
libraries in the management or activity of the archive. 

-To be answered when I receive a definition of access intergration. 

-Our goals in the future include including the WA data in our library catalog. We want to start 
this process in early 2019 to provide better access to the public. 

-We will soon add a link to our web archive to our "Collections" page, where we link to our 
finding aids and our digital resources. We will not be incorporating web archive material into our 
catalog at this time. 

-not at this time 

-We are not ready to talk integration at this point because the library will be transitioning to a 
new ILS in the near future. In the meantime, we hope to emulate DCPL's Washingtoniana Web 
Archive landing page that links to Archivelt. 

-I think a collection level MARC record would be a great idea for us. 
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-I have not incorporated the WA collection into our catalog yet. I will need to collaborate with the 
IT manager to determine how to provide access to the library’s WA collection from the library’s 
domain from the Local History page of the web site. I have approached her regarding providing 
access, but we have not accomplished it yet. 

-Our conversation with NYARC might be produce some integration. I hope that we can link to 
our library catalog but as of now the intent has been to simply build collections. 

-Yes, I do plan to incorporate our WA into our digital archive (CONTENTdm). 

-Once we’ve had a chance to make our current web archive collections more accessible, we’ll 
link them from our website. However, since we’re a part of a federated system, we have only 
limited access to our library catalog. Historically, there have been very few non-circulating items 
listed in the catalog. The only digital items currently listed are those which can be checked out 
through Overdrive or OneClickDigital, both services which are paid for jointly by all libraries in 
the system. 

-We plan to promote the site on both the library's website and potentially the catalog down the 
road. 

-Yes, we plan to create a record in our catalog for each web archive collection. 

-Birmingham Public Library plans to add MARC records into our cooperative library’s catalog 
that will link to our collections in Archive-IT. We are also contemplating adding links to 
Archive-IT from our CONTENTdm digital collections database. 

-I’m not sure how we would provide access integration with other library systems. Other than 
participating in DPLA and ILL services, I don’t know of other means that we integrate with other 
libraries. As far as our catalog is concerned, I am beginning to work with our cataloger so we 
can determine a workflow linking our catalog to serials for local resources that we’ve crawled. 
These catalog record links could eventually extend to our records in Worldcat. I need more 
information about various ways this can be done. 

-This is something we will work on when we have the new staff on board. It is unlikely that 
things will be in our catalog due to chronic short staffing in our cataloging dept. 


12 



Community Webs Y2 Survey Responses 


-We plan to cross-promote our web archive with our local history portal (Digital Douglas County 
History), and may also link to the web archive through our catalog. 

-not yet 

-We will feature as an embedded link on our website. 

-Yes. We link out from our ContentDM system, and those records are exported into WorldCat, 
and will, ultimately, be downloaded into our OPAC. 

-Eventually we would like to get our WA collections into CONTENTDM, and perhaps at a later 
date have the WA put into our catalog. 


What aspects of your web archiving progrann have proven successful 
thus far? 

-Having a place to store electronic news articles related to current events. 

-I did a lot of planning and testing by creating lots of seeds, but not necessarily saving them, so 
that I could get a feel for how it all worked. I was initially trepidatious in saving any of the data I 
collected, but I finally decided to take the plunge and started collecting lots of websites and 
social media in a variety of categories (town government, restaurants, religious organizations, 
etc.). The more categories I filled out with web and social media sites, the more I got the sense 
that what I was collecting was a more accurate representation of life in Westborough today than 
any other physical collection that I could identify and bring in. It makes sense: so much of our 
lives are now carried out online. The impact of what I am collecting now may not be felt today, 
but I believe that it will years from now, when someone wants to research the early twenty-first 
century. 

-Staff engagement has been strong and people do see the value of web archiving (which we 
were worried about). We have been able to compile a very comprehensive and thorough 
collection of sites without being limited by our data budget. 

-Excitement still exists in the community. One commercial company's website that we archived 
to tell our story has now been sold to a larger company. Archiving this site will offer a story in 
continuum. Running crawls of most websites have went well. 
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-Collaborating with community organizations has proven to be way more successful than initially 
thought. Once those involved understand what the program is and what we're trying to do, they 
get excited about participating and helping to create collections and figure out websites they 
think are important to archive. 

-Saving small community organization websites. 

-asking other people in my department to give me ideas on what to archive 

-documenting our local government websites and the sites of groups that were formed in 
response to the fires 

-It’s integrated as a regular part of our practices in Metadata Services. It hasn’t made a broader 
impact yet in our institution or for our researchers. 

-Crawling the town, library and schools websites has proved very successful. I plan to add the 
library’s Facebook and Instagram sites. Also, proven to be successful is the local online 
newspaper, We-Ha. What I really like about the crawls in how comprehensive the information is 
from these crawls. 

-Overall interest in social justice related web archiving has garnered a lot of interest. Capitalizing 
on current events might be a strategy to bring more people and users to web archiving. I also 
think the education component has been successful. I've enjoyed educating people about web 
archives and particularly how this might be a practice that is necessary for not only students but 
anyone. Talking about Archive-lt and Web Recorder shows the possibilities of web archiving at 
the institutional level and the personal level. 

-I think the platform is really easy to use, the problem is deciding what to collect. 

-We have managed to archive some content. 

-Interest in web archiving has grown and people can see the value of capturing sites in the 
moment, especially as print materials are limited. 

-Getting started at all is the thing I am most excited about! Over the last year I’ve learned how to 
do web archiving, I’ve successfully created and managed several collections, I’ve run into 
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problems and solved (most of) them. I’ve also started thinking about web archiving at a higher 
level, thinking about the theory and how to approach it (how to organize collections, how will 
patrons want to access things, should we get some form of copyright signed, ethical 
implications, etc). 

-I often think about how much more we need to do, but it’s also nice to reflect on what we have 
done already and how much further we are today than a year ago. 

-We have partnered with BPL’s Archives Department to ensure that we are collecting not only 
the physical documents and materials from local organizations but also collecting the websites 
and online documents from the organizations as well. 

-I’ve gotten more adept at crawling sites. I have generated interest in archiving websites from 
my colleagues, though frankly, not a lot of collaboration with these colleagues yet. I’ve learned 
that it’s better in some cases to capture the portion of the site I want than to spend a lot of time 
troubleshooting ways to get a complete capture of an entire webpage. I have found that 
sometimes I don’t have the time or skill to figure out why a capture didn’t work. I’ve either saved 
imperfect captures, or I’ve narrowed the URL to just what I want to capture so it can be a good 
capture. Not sure if this is a true success or more of a work-around. 

-One of our staff spent a good amount of time archiving the SF government pages after Mayor 
Lee's death. This helped us learn the various aspects of web archiving, including trying out 
social media, with a little bit of time pressure since we weren't sure whether the pages would be 
taken down or replaced. I think this also helped people outside of our dept, understand the 
importance of web archiving, and understand a little better what we've been up to. 

-Community members who have provided input love the opportunity to be involved in shaping 
collections. Having the opportunity to preserve resources that offer an unprecedented window 
into community life during this time, and to be more inclusive in our collecting. 

-Building relationships with our community partners. 

-The actual crawls and collecting is easy, everything else takes time. 

-Complementing existing digital and physical archives with web archives. 
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-The ability to archive lesser known organizations that represent underserved citizens and 
underrepresented groups. 

What aspects of your program have been the most challenging or 
required re-thinking along the way? 

-deciding on a crawl schedule for most sites. Daily was too much data so we had to change to 
weekly for some sites, and monthly for the ones that hardly changed. 

-I struggled with the worry that I was not engaging my community enough in making decisions 
about what to collect. In addition to my reservation about creating a focus group that would 
consist of the same people who always raise their hand to volunteer in our town, I knew that 
such a group would radically slow down my ability to collect. But when I came to the conclusion 
that few people in town think about collecting as deeply as I do-l am an expert in it after all-and 
that I interact with lots of different groups on a regular basis and monitor closely what is new in 
town, then I was able to go gang-busters and create what I believe works towards representing 
what our town is all about. I will continue to push people to take a look at what I am collecting 
and help me make decisions. I see nothing wrong with allowing an expert to take control of a 
situation, as long as that person does so seriously and responsibly. 

-It has been challenging to find the sweet spot for crawling sites that balances completeness 
with data size. In the beginning I repeatedly test-crawled sites in order to lower the size by 
rather modest amounts. As I realized that our data budget was sufficiently generous i realized 
we could accommodate larger crawls. In general reviewing test crawls has been a larger time 
and attention suck than I anticipated and I have been able to reduce that burden somewhat. 

-Robots.txt and social media archiving. Learning technical aspects through webinars rather than 
a hands-on approach. Larger institutions have access to a greater number of people with 
technical knowledge. 

-Getting the public interested in web archiving is somewhat challenging. Once you effectively 
explain the program, most people are interested and excited in the program. Developing a short 
and effective elevator pitch for the program is the key. We've had to re-work the wording for 
brochures and speaking about web archiving in various outreach activities many times. 
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-Two things: 1. not having the data budget to save the iocai news websites we were hoping to 
save, and 2. struggiing to create meaningfui connections/coiiaborations with the organizations 
whose websites we are crawiing. 

-What to archive, how often to run scans. 

-Capture of sociai media content has been chaiienging, particuiariy pubiic Facebook groups and 
Twitter hashtags. There were a iot of mapping projects that we were unabie to capture with 
Archive it. We tried using webrecorder on these maps, which worked weii. But we were unabie 
to get the reiated WARC fiies to render in Wayback. Another chaiienge was with audio fiies of 
radio broadcasts. The audio fiies were captured by Archiveit but didn't aiways piay back in 
Wayback. We had to rethink our attempts to crawi the oniine edition of our iocai newspaper. 
There was too much muitimedia content making it heavy on data. Our iater strategy was to 
crawi the "fire coverage" section of the website. 

-Getting the crawis to work can be chaiienging. And getting aiready busy pubiic service staff 
invoived is very chaiienging. 

-There have been severai chaiienges. Navigating the crawiing technoiogy, determining why 
things go wrong and adding metadata. However, the addition of the heip box in the bottom 
corner of the screen has been a tremendous heip. From my perspective, it wouid be great if i 
had had an opportunity to speak directiy with someone for 15 minutes maybe 2 or 3 times 
during the year. Often times, the answer to one question ieads to another question which 
invoives iooking in the heip section, writing the emaii, waiting for the repiy.... when it wouid be 
terrific to get the answer in context and buiid upon it. 

Aiso, it aiways seemed to take ionger than 24 hours to view the resuits of a crawi under test 
crawis. 

-The most chaiienging has been being a team of one. Aiso as i began to recruit within the iarger 
organization of NYPL and making considerabie headway, NYPL's one and oniy digitai archivist 
ieft. 

-i think the most chaiienging so far is getting input from the pubiic on what we shouid coiiect. For 
me the chaiienge is aiso getting the pubiic engage on the topic. 

-When we began this project, we had expected that there wouid be considerabiy more 
born-digitai content that was specific to our community, in many ways, that was a compiete 
mismatch of expectations: those of us who were newer to our community were surprised by the 
iack of digitai content, and those of us who grew up here just assumed it was normai that even a 
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regional hub in a rural area would have so little digital content. As such, we’re trying to think of 
ways to encourage the creation of more digital content. 

We were hoping to add to our digital footprint with our planned memoir project, but we’re still 
looking for grant funding for the equipment necessary. We also have a comparatively large 
immigrant and first-generation American population, including Somali, Karen, Spanish, and 
Hmong. We were looking forward to archiving content from their communities, but we 
discovered a definite lack of digital representation in those communities, as well. 

The hurry-up-and-wait nature of web archiving has also been challenging to schedule around in 
terms of workflow. 

-Selection of sites to capture has been the most difficult because we want to include all, but 
identifying the various sites to consider (webpage vs. social media or both) has made the list 
larger than anticipated. 

-Having the staff time and knowledge to get it all done! It is vital that we expand our special 
collections into digital spaces, but it is essentially adding entirely new responsibilities and tasks 
to our already full load. It’s difficult to find capacity to work on and provide training for web 
archiving and born-digital archiving. 

-The most challenging has been working with Archive-IT software. It has been frustrating trying 
to find the best coding formulas to save most of a website without the pitfalls and failures. 

-Skill at assessing test crawls and making improvements has been challenging for me. When I 
have gained skill (and some efficiency), it has seemed like growing the project to reach the 
community and create partnerships has seemed impossible for me in some ways - sort of like 
going from 0 mph to 100 mph too quickly. To be more successful in outreach to my community I 
need to find a way to bring more people into my project to contribute. In my department this 
year, we’ve had some unforeseen staffing setbacks (vacancies not filled) and a service 
expansion request to hire of a new position (digital archivist) fizzle out. However, I think that my 
participation in the Community Webs project might help us build our case next year when we try 
once again to create a digital archivist position in our department. 

-The most challenging thing for us has been staffing and knowledge retention. Losing the staff 
who have been assigned web archiving has been a chronic problem for us over the past 10 
years. With digitization becoming a priority for the library, we have expanded web archiving job 
duties from 1 person to 3, with the hope that having multiple people trained will help us continue 
web archiving through staffing transitions. 

Lack of organization in past crawling has created a disorganized set of collections, which ideally 
we would have grouped together into one collection. We hope in the future we'll be able to 
migrate multiple seeds into one collection. 
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Another challenge is beginning to work with staff cross-departmentally. It is something that has 
not been done often in an ongoing manner. I think this will be critical for us in order to develop 
more community involvement. 

I think also a lack of knowledge about archiving fundamentals has also been a challenge. It's 
hard for some of us to wrap our heads around doing this work that might be useful to someone 
sometime. In public libraries, our focus is usually how to get the most out of any effort we make. 

-Some of the challenges are technical: many of the resources that we've identified for collection 
have multimedia or social media components that have proven tricky to capture. Some of the 
challenges are related to capacity: finding the time to research resources to capture, to 
coordinate community outreach, to familiarize yourself with the quirks of working with Archive-lt. 
Also, the Archive-lt interface isn't totally intuitive, and the time lag between launching a test 
crawl, being able to find out how it went, and relaunching a crawl with tweaks is an obstacle 
when you are shoehorning the process into your regular responsibilities. And some of the 
challenges are related to the lack of mainstream understanding of the value and ephemerality of 
web resources. 

-The time and resources it takes to build those relationships with our community partners. 
-Collection development and permissions policies. 

-Documenting websites that belong to individuals rather than organizational sites. There can be 
issues with technology and rights that are knotty. 

-The amount of data that is ingested all at once usually times out. We need to rethink our level 
or archiving. 

What ideas do you have for the future development of collecting web 
based content within the public library? What ways can the cohort 
support the larger public library community to extend the program and 
encourage other libraries to engage in this type of collecting? 


-Adding additional sites from larger institutions and finding lesser known community 
organizations that may only have Facebook accounts. 

-At some point, I will be looking for ways to put the content I have collected to use. I have a 
friend and colleague who is an anthropologist who does work on food systems and 
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sustainability. We have been talking for years about collecting menus from restaurants in town 
and then analyzing them for cultural, environmental, and culinary shifts. I originally started to 
collect physical menus from local restaurants, but this method became tedious, and our idea sat 
dormant. Now, with Archive-lt, I can collect the menus of local restaurants on a regular basis 
with little effort. Years from now, I plan to create a celebration of Westborough Foodways (we 
have lots of great restaurants, working farms, an agriculture heritage, and a shifting cultural 
community) and as a part of this program seek a grant to fund my colleague's analysis of how 
we eat food in our town. 

-The web archive needs to become the new vertical file, fully integrated into the regular practice 
of documenting local history. Now that a substantial number of public libraries have started 
archiving, there is an opportunity to reach out regionally and provide toolkits, advice and 
training. 

-I perceive that the collection of web based content will seamlessly become an integral piece of 
all collection development policies and that students in library and information sciences enter 
the working world armed with the knowledge to work within an electronic environment. 

The cohort can support the greater library community by sharing policies, collections, processes 
of creating public awareness, success stories and bumps in the road. It is important to introduce 
the concept and keep the conversation alive. 

-Based on our own organization's collections, we have created a pretty complete snapshot into 
our current community. That in and of itself is a good way to excite the public library community. 
They can collaborate with their community members and organizations to create this 
community-curated archive of important aspects of their town/city through websites that might 
otherwise go unnoticed or will disappear forever. 

-The cohort could create a toolkit based on our experiences that would help others to dive into 
web archiving. Ultimately, however, I think the biggest obstacle will be the same as it always is 
in public libraries: money. It's hard for me to imagine how I would support other libraries in this 
work when I'm not sure I can even continue it at my own institution. 

-I think a lot of Library staff don't fully understand what it is or why it is important. Like a lot of 
other things in Library Land - advocacy is needed. 

-I'd like to see Special Collections departments of public libraries spearhead the use of web 
archiving to collect digital content that reflects our current communities, particularly addressing 
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the gaps and silences in our historical collections. I have two ideas about ways that the program 
could be extended by the cohort: (1) develop an online "manual" for how to build a web archive 
aimed at public libraries that includes the CommWebs training modules alongside examples of 
how specific libraries have implemented the different steps in the process and (2) organize 
regional groups of web archivers that meet up, in rotating locations, to discuss ideas and 
challenges (meet-ups could occur at conferences as well. 

-I think it’s most effective to demonstrate what could be lost when a page has been captured 
and then changes. Pulling out good examples and compiling them into a video, TED talk, or 
some other medium aimed at a general audience (and I’m sure someone has already done 
this... probably lA... many times) seems like the most effective way to get others motivated to do 
web archiving. Without the social relevance piece, it’s just another project we should be doing 
amongst many other preservation and info access duties. 

-This year I plan to extend my scope to create collections for business groups, arts 
organizations and church groups. They all contribute to the vitality of the town. I would also like 
to create a collection dedicated to town events and celebrations. Though these events are most 
likely captured under the Town’s website crawl. I’d like to create a separate collection because I 
think it would easier for people to access it this way. This also is the case with the online 
newspaper. The major business groups are covered with those crawl, but again, I think it would 
be more appealing to have a separate collection. 

What made me want to learn web archiving was when I was trying to save documents on key 
events in town (new school, new town development) that did not present properly in print format. 
I realized that if I wanted to archive these key events, the best way would be to capture them in 
the medium that they were meant to be viewed. Librarians need to realize that the events of 
today are the history of tomorrow and that most of the events are being created digitally. Unlike 
a print document that I have on my shelf from 1998, the webpage with key information may not 
be around in 20 years when an individuals wants to know the background and the issues that 
lead to a decision that was made. 

-Use the same crowdsource model that other institutions use like Ivy Plus. For example the work 
of Queens, Brooklyn, and New York public libraries could really build a wonderful collective 
documenting the communities and everyday people of NYC. 

-I have no input at this point 

-The program does assume that there is already a lot of digital content available to be collected. 
Components on how to encourage communities to develop those web presences would be 
helpful. We’re especially aware that there’s a lot of “hidden” discussion that we either aren’t able 
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to locate or have no way to record. The newspapers might report the basics about a situation 
and a few public comments, but we don’t really see how it affects everyday life. Of course, not 
talking to each other about things is very SW Minnesotan, so those hidden discussions might 
not be happening at all. In that case, the library needs to step in to facilitate more of those 
discussions. 

Likewise, facilitation of discussion between libraries would be helpful. Even among grant 
participants, it’s hard to feel a sense of community. It would be helpful to have, for instance, an 
open forum where we could post comments and questions to other librarians that might not fall 
under one of the specific forum topics in the lesson plans. I suspect a lot of us have run into 
similar logistical challenges and frustrations, and we’ve probably slowed ourselves down by not 
sharing our ideas. 

-Ideally, funding is always difficult for public libraries, so offering opportunities to capture items 
and be able to organize them economically is important. It's also important to discuss the value 
of web archiving to other libraries and promote its role in collection development. 

-The user interface and training documents for archive-it are still pretty complex. Providing some 
basic training on how to get started or how to do web archiving in a really simple way would be 
helpful. Public library staff are usually doing a lot of different things and may not have the time 
or dedicated attention needed to really understand the complexities of web archiving. This is 
going to be particularly true in public libraries that don’t have physical archives. 

Very practically speaking, it would also help to publicize that pricing for public libraries may be 
lower than for universities (or that you’re willing to work with us). The only prices I ever heard 
were from large universities and I never inquired with Archive-lt directly because the prices I 
heard were too high. 

-Partner with each state library (Alabama Public Library Service is our state library) to get the 
word out to every public library in their state. Perhaps state libraries could be convinced to 
create state-wide collections. Many states already have memory projects for digitized materials 
and could be encouraged to add web based content. 

Provide an alternative with written instructions for small public libraries to share local websites 
without having to subscribe to Archive-lt. Many have little to no technical expertise but they do 
know what is important in their community. 

-I think we need to share our experiences very locally, not just to the echo chamber of the 
professional archival community. We need to educate our own colleagues in our immediate 
libraries, as well as our customers about using the “historical” internet as a resource. I think I 
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would benefit from spending more time using the Wayback as a resource (and not just crawling 
web sites) to relate to other library customers better. 

I plan to write a newsletter item for the Society of Rocky Mountain Archivists this spring. But 
given the range of libraries across Colorado and region, I don’t understand very well how best to 
encourage smaller libraries to collect the web. Should they suggest sites for Archive-lt to crawl? 
While it would be great for them to learn to archive the web themselves, is it financially 
attainable for their library to become a paid partner of Archive-lt? 

With time, web archiving will become an inclusive part of how we think about our collections at 
DPL, but this won’t happen overnight. 

-I'm looking forward to a point when more staff have an understanding about web archiving and 
come to us with their ideas. I can make guesses at what might be relevant, but the staff who are 
working with communities on a regular basis will likely have better ideas! 

The best way for me to explain web archiving to other people at my library is highlighting 
specific collections. The blog posts from libraries participating in the grant have been really 
helpful as well. I think having some type of communication channel would be great. I noticed 
that this grant cohort doesn't seem to use the ones we set up, but I bet libraries that are 
considering this would love to have a place to go and ask questions. 

-If Archive-lt were less fiddly, it would make the learning curve for the time-strapped web 
archivist less steep. For instance, the suggested settings for some of the known web archiving 
challenges like Vimeo or Instagram could be applied as the default across the board instead of 
just in some cases. Or, making it easier to differentiate between the Wayback link that leads to 
your test crawl results from the Wayback link that doesn't lead there. Or, creating a tool that 
allows libraries to create curated collections of resources that have already been crawled and 
captured by the Wayback Machine. Really, anything that lowers the bar related to difficulty, time 
intensiveness, and cost. 

-Get the word out. Let's write an article for the American Archivist and /or Archival Issues, or 
other. Offer sessions, workshops, or brown bags and both national but also regional 
conferences/meetings. Have a site on the lA Community Webs blog or website, where each 
cohort member summarizes their project, lesson learned, recommendations, and allow other 
public libraries to contact them directly. 

-I hope to continue to collect web content for future use as well as extending the program at 
conferences and meetings. 
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-We should probably be doing web-archiving managed at a mix of state and consortial levels. 
Some of the administrative and technical functions might be offloaded onto a state library. 

-The collections fits in nicely with a pilot program we want to start next year to collect materials 
for communities where we have very little history archived. It would be ideal to harvest new web 
content that correlates to new parts of town that we begin to archive. 


What would your recommendations be for a public library embarking 
on a new web archiving program? 

-Look at the web presence of local organizations first to see how many there are. Also think 
about how the patrons might find a saved version of a website valuable. 

-I am afraid that Archive-lt has a steep learning curve, and I just don't know how to get around 
that. I have spent many hours working with and learning about the tool. It will be difficult to 
convince librarians/archivists who already feel overworked to put in the time to learn how to use 
Archive-lt. That being said, I am convinced that if we as archivists want to accurately capture 
what our communities are all about today, Archive-lt (or some similar tool) is crucial. If we are 
not collecting web and social media sites today, then we are doing a disservice to those in the 
future who will want to research our lives and times. 

-It should be funded just like any other collecting activity, with a budget for data, a staff point 
person and a promotional and access strategy. 

-Creating awareness of web archiving and the necessity to preserve that which might be lost or 
otherwise unable to easily be shared. To be certain that the collection has relevance to the 
community or institution it will represent. The need for one or more staff to have the technical 
knowledge necessary to upload info, conduct crawls and create meta data. 

-Make sure you have a good team before starting the program. It's important to have more than 
one person to work on actively archiving websites and creating metadata. Creating a landing 
page for your web archive is also very important. You can explain more of what the program is, 
highlight collections, create FAQs, and garner more interest if this is completed before you 
showcase your web archive to the public! 
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-Have a dedicated staff member or at least a part-timer. I have found the process to be more 
time-consuming and harder to fit in alongside my many other duties than I anticipated. 

-Be flexible. You may find that your initial plan doesn't work out exactly how you want it to. 

-Work hard to create a group of advocates for web archiving including the library director, head 
of digital services/collections, and representatives from public services, IT and marketing. It 
requires a range of stakeholders to educate and promote web archives. The more these 
advocates can raise awareness of the need for web archives and explain what they are and 
how they work, the better this new service will be understood, assimilated, and utilized. I think 
an excellent way to recruit advocates is to show them the web archives of major institutions like 
LOC and universities, examples of thematic web archives, and most importantly the web 
archives of other public libraries. You can especially highlight the potential of web archiving to 
connect the public library with small, local, diverse community groups that might not otherwise 
partner with the library. 

-To engage with stakeholders to determine which websites they want to capture. 

-Create a collection development policy for you library if you don’t have one. That will help you 
focus on what you want to collect digitally as well as in print. 

If at all possible, have more than 1 person involved and committed to the project. 

Learn about metadata and how to use it effectively. 

Revisit whatever decisions you make on an annual basis. 

The importance of having commitment from the administration of the library for this project. 

-Create archives that supplement the collections you have. As your abilities and expertise grow 
embark on building new web archive collections that expand your collections and expand 
possibilities for library and community engagement. Also be intentional and provide the 
necessary time to successfully carry out the work of web archiving, because it is a full time job. 

-I would recommend the organization establishing a team that work on web archiving. One 
person can certainly be in charge of the entire web archiving program but I think a team of two 
or more would be so much better in terms of sharing responsibilities but also that can support 
each other. The first year of this project was challenging for me because I was the only person 
doing web collecting in addition to my other duties. I recommend a two person team would be 
so much better. 
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-Make sure that you are familiar with your community’s web presence and comfort level with 
digital media. If there isn’t already a robust presence, have a program already in place to help 
expand that presence. We have lots of great plans, but it doesn’t seem that they’re going to 
come to fruition during the grant period for a variety of reasons. 

Plan for the hurry-up-and-wait workflow. I’m still working out how to best schedule time to work 
on our collections. Since it’s difficult to know how much work you’ll be able to do at any given 
point, depending on crawl results and crawl times, it’s a bit of a challenge. 

Based on our experiences and the comments of other libraries at the previous in-person 
meeting, we’d recommend for archiving in rural areas to be focused on regional hubs, but not 
necessarily on very small communities. In our library system, which serves rural SW Minnesota, 
the majority of libraries employ only part-time staff. Even the library directors are part-time staff. 
It’s not likely that those small community libraries would be able to find or justify the amount of 
time that would be needed to be allocated to web archiving. Also, if the digital footprint of those 
communities is as small as those of the communities around us, the time spent learning the 
tools necessary to archive wouldn’t be worthwhile. 

-Try to do as much research and organizational work before undertaking. It's important to have 
a clear collection development policy, specifically for web archiving and get local support and 
interest before collecting anything to help identify the best sites/pages to capture. 

-Have a defined sense of what it is that you want to record and save. Don’t be afraid to make 
mistakes - just get started! 

-Make sure there is a staff person who has technical experience. 

-Dabbling is a great way to learn, but try to narrow a collecting goal very early on. For instance - 
events in my town, news websites that may be temporal, serials that are suddenly only found 
online, politics during an election cycle. Choose a collecting focus (or several) for which you can 
recruit a team internally. This team can learn how to crawl sites, assess results, and can 
generate a scope of collecting that’s reasonable for your library. There’s a lot of checking in with 
your Archive-lt site to review crawls, test scoping rules, so I think having more than one person 
to follow through with those steps might be more efficient than just one person focusing on the 
project. 

-Train multiple people! And think through your collections ahead of time. 
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-Be prepared to invest a considerable period of time in the community outreach component—it's 
a very word-of-mouth type process, and it takes time for word to spread. Make sure someone on 
staff has the time to devote to learning the tools. 

-start small 
-Plan, plan, plan! 

-Match your web collection to your analog collecting. 

Take the opportunity to collect prospectively. 

-Narrow the number of websites you archive, but make them broad in terms of representation, 
and deep when it comes to archiving actual web content. 


What topics do you think would be nnost useful to cover in the white 
paper? 

-Programming and publicity tactics; How a saved website can be a valuable resource 

-1) The challenge of getting a website and social media collecting program off the ground; 2) 
how doing so should fit into a comprehensive strategy for rethinking how we practice local 
history today by making it more inclusive, representative, and relevant; 3) creative ideas for how 
these web and social media collections can and should be used in the future, so that public 
librarians/archivists will see that if they don't sign on to this technology today, they will be 
woefully behind in fulfilling the needs of future local history researchers. 

-Common technical barriers to archiving sites such as proprietary software, crawl traps and 
problematic plugins used widely on popular platforms. 

-Sustainability. Training. Ideas for community outreach and engagement. 

-potential privacy concerns and challenges / copyright issues specific to web content / outreach 
and programming examples / sample data budgets and accompanying collection development 
policies - what this looks like for differently sized organizations / potential (research/other) uses 
for web archives / examples of collective web archiving efforts like CoWeb / ways to automate or 
produce "good enough" metadata 
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-The emphasis should be on how public libraries intuitively had a different approach to web 
archiving than other types of institutions that had approached the task. The community-based 
aspect is really an important thing to explore. 

-Why web archiving is important. 

The importance of training and how long it takes to become familiar with the crawling 
technology. 

Emphazing how critical it is to have a web collection development policy. Many public libraries 
don’t have a local history collection, local history room or trained archivist. One individual may 
be the person who is responsible for local history but that person also probably works the 
reference desk, does programming and/or administration. Not only is time an issue, but 
choosing what to collect can be overwhelming without a document in place that can help that 
individual focus. 

-How current events catalyze communities to quickly publish on the web and how this highlights 
the importance and urgency of public libraries to build web archives. Public memory is a fickle 
thing in that we are outraged but quickly forget. The internet can be a fickle thing too where 
content is created is quickly and just as soon forgotten. 

-Community engagement (collection input), value of web collecting, buy in from administration, 
incorporating web archiving in collection development policy, funding for this type of project 

-We suggest including a section on “failing.” As a community that discovered we had 
considerably less to contribute than others, we feel as though we’ve fallen short from the start, 
but we also believe that this experience will help us. Because our digital footprint is so small 
now, it’s also manageable for a smaller library. We have no doubt that our digital footprint will 
grow considerably in the next few years, and we now have tools and background knowledge to 
help us avoid being overwhelmed by it. 

We also suggest a comparative section to note the state of affairs for digital content in different 
parts of the country and different communities. 

-Collection policy, how to use it going forward 
-Basic training manual for public library staff 

Copyright / licensing - what responsibilities do we have in terms of securing rights to material, so 
that researchers can use it? 

Organization and access of web archives - this is still a new field, and I don’t have a good sense 
of the best way to organize web archives for researchers. 

Highlight different types of content that public or community libraries could archive 
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-How best to capture social media sites 

How to capture websites created using popular website builders 

-Promoting web archives from small everyday interactions to larger efforts at community 
outreach. Advocating internally to keep web archiving going. 

-I'm interested in hearing more about successful programs that engaged communities directly. 
As well as how people promoted their collection to the community. 

-Effective strategies for soliciting community input, or understanding of the value of web 
archiving. Streamlining the process of testing and refining crawl settings. 

-collection development 

-Developing a project charter and scope before diving into web archiving immediately. As well 
as how to communicate with community partners the importance of web archiving on a local 
level. 

-Collection development dovetailing with analog and digital collections, human resource costs, 
OPAC integration, administrative overhead, other investments in time and money. 

-Using web archiving as a spring board for building bridges in communities and creating deeper 
and longer lasting relationships through partnerships and building relationships with 
communities by trading information and materials. 
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Appendix 6: Year 2 Meeting Agenda 



COMMUNITY 
IWEBS 



Year 2 Meeting Agenda 11/2/2018 


9:30-10:00 -- Opening remarks 
10:00-11:00 -- Breakout Session 1 
11:00-11:30 - Coffee Break 
11:30-12:30 -- Breakout Session 2 
12:30-14:00 - Lunch 
14:00-14:30 - Cohort Talks 
14:30-15:30 -- Breakout Session 3 
15:30-16:00 -- Coffee Break 
16:00-17:00 -- Breakout Session 4 
17:00-17:30 -- Summarizing 

Please refer to the Breakout Sessions and Groups 
handout for information on which group you are in. 


Don’t forget to sign up and attend Archive-lt office hours! 
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Breakout Sessions and Groups 


Breakout Session 1 

10 : 00 - 11:00 


Breakout Session 2 

11:30-12:30 


Sustaining Programs 

Athens Regional Library System 

Birmingham Public Library 

Brooklyn Public Library 

Buffalo & Erie County Public 
Library 

Cleveland Public Library 

Columbus Metropolitan Library 

County of Los Angeles Public 
Library 

Denver Public Library 

East Baton Rouge Parish Library 

Forbes Library 

Grand Rapids Public Library 

Henderson District Public Libraries 

Kansas City Public Library 


Technology and Tools 

Lawrence Public Library 
Marshall Lyon County Library 
New Brunswick Free Public Library 

Patagonia Library 

Pollard Memorial Library 
Queens Library 

San Diego Public Library 

San Francisco Public Library 
Schomburg Center 
Sonoma County Library 
The Urbana Free Library 
West Hartford Public Library 
Westborough Public Library 


Sustaining Programs 

Lawrence Public Library 
Marshall Lyon County Library 
New Brunswick Free Public Library 

Patagonia Library 

Pollard Memorial Library 
Queens Library 

San Diego Public Library 

San Francisco Public Library 
Schomburg Center 
Sonoma County Public Library 
The Urbana Free Library 
West Hartford Public Library 
Westborough Public Library 


Technology and Tools 

Athens Regional Library System 

Birmingham Public Library 

Brooklyn Public Library 

Buffalo & Erie County Public 
Library 

Cleveland Public Library 

Columbus Metropolitan Library 

County of Los Angeles Public 
Library 

Denver Public Library 

East Baton Rouge Parish Library 

Forbes Library 

Grand Rapids Public Library 

Henderson District Public Libraries 

Kansas City Public Library 
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Breakout Sessions and Groups 


Breakout Session 3 

2:30-3:30 


Breakout Session 4 

4:00 - 5:00 


Collection Development and Policy 

Lawrence Public Library 
Marshall Lyon County Library 
New Brunswick Free Public Library 
Patagonia Library 
Pollard Memorial Library 
Queens Library 
San Diego Public Library 
Denver Public Library 
East Baton Rouge Parish Library 

Forbes Library 

Grand Rapids Public Library 
Henderson District Public Libraries 

Kansas City Public Library 


Programming and Outreach 

San Francisco Public Library 

Schomburg Center 

Sonoma County Public Library 

The Urbana Free Library 

West Hartford Public Library 

Westborough Public Library 

Athens Regional Library System 

Birmingham Public Library 

Brooklyn Public Library 

Buffalo & Erie County Public 
Library 

Cleveland Public Library 

Columbus Metropolitan Library 

County of Los Angeles Public 
Library 


Collection Development and Policy 

San Francisco Public Library 
Schomburg Center 

Sonoma County Public Library 

The Urbana Free Library 
West Hartford Public Library 
Westborough Public Library 
Athens Regional Library System 
Birmingham Public Library 
Brooklyn Public Library 
Buffalo & Erie County Public Library 
Cleveland Public Library 
Columbus Metropolitan Library 
County of Los Angeles Public Library 


Programming and Outreach 

Lawrence Public Library 
Marshall Lyon County Library 

New Brunswick Free Public Library 

Patagonia Library 

Pollard Memorial Library 

Queens Library 

San Diego Public Library 

Denver Public Library 

East Baton Rouge Parish Library 

Forbes Library 

Grand Rapids Public Library 

Henderson District Public Libraries 

Kansas City Public Library 
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Community Webs Project Feedback Survey 

Responses 

Please rate your proficiency with web archiving prior to participating in the 
Community Webs program 

16 responses 
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Please rate your proficiency with web archiving after having participated in 
the Community Webs program 

16 responses 
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Do you expect to continue web archiving at your institution after the 
Community Webs program ends? 

16 responses 



# Yes 

• No 

ft I'm not sure 


What roadblocks, if any, do you foresee for the web archiving program at 
your institution? 

16 responses 


Lack of institutional buy-in 


Lack of funding 


Lack of staff time 


Lack of community interest 
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How helpful did you find the resources in the Community Webs online 
course? 

16 responses 


8 

6 

4 

2 

0 ( 0 %) 

0 - 

1 


0 ( 0 %) 
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Are there additional resources or areas of focus that could have 
improved the curriculum? 


-site visits from trainers to work with staff and meet with leadership to explain web archiving and its 
potential uses 

-Patch Crawling -1 did not understand how to patch crawl until very recently since it is only accessible 
after the crawl has been saved. 

-More help with how to troubleshoot test crawls that don't return expected results. 

-An official user manual with FAQs and trouble shooting 

-I would have like to have been able to have more sessions where we could have called in or discussed 
how to resolve problems that arise as you're learning how to web archive on a regular basis. Perhaps a 
bi-weekly hour online meeting where you could ask the question, get a response and then follow up 
without having to go back and forth with emails and the time it takes. 

-More detailed QA instruction. 

-I think the curriculum covered most areas 
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-I think more resources on how to get the community to be more involved in selecting subjects to collect 
as well as seeds. I think in general more strategies and techniques for getting more community 
engagement. 

-They were overall very helpful. I had to play a few of them a couple times to understand fully what was 
being conveyed, but I came out of the sessions pretty confident with what I was doing. There are a lot of 
moving parts, and it is hard to keep them all together. 

-Direct hands-on training time for those with no or limited experience 

-I think a bit more time spend teaching how to troubleshoot web crawls, patch crawling and metadata 
creation would have been good. 

-Helping to create partnerships and/or community groups to 
-The Archive-lt API. 

-a page dedicated to professional development opportunities in web archiving 

-More interactions with grant participants, more opportunities to ask peers questions. 

-The curriculum was great. My one wish is that we'd formed a stronger community online as a cohort. We 
probably needed to be encouraged/forced to post more online or in slack.:) 


Please rate your overall satisfaction with the Community Webs program 

16 responsesi 

10.0 


9 (56.3%) 


7.5 

5.0 

2.5 

0 (0%) 0 (0%) 0 (0%) 

00 - 
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7 (43.8%) 
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Please provide brief feedback about your experience participating in 
the Comnnunity Webs progrann. 

-always nice to be part of a cohort who learn together 

-I had so much fun learning this new technology! The program has brought me to places I never thought I 
would get to see (San Francisco and Ohio, among other places that I went to conferences at)...The 
comradery between the cohort members was fantastic and getting to meet with each other at the annual 
meeting really made us feel like we weren't alone and that we were in fact doing a very good job. 

-The Community Webs program not only expanded my knowledge of archiving web information but also 
brought the library closer and more connected with several local groups and organizations to partner 
with. The only frustration was with the Archive-lt program and trying to successfully capture the websites 
we wanted to archive. 

- It was great, I would like to devote more time to it in the coming months and years. 

-I am so glad that i was able to participate, but it was very frustrating at times. I was the only person at my 
library doing web archiving and it seemed like I work on it for concentrated times and then when I'd come 
back I'd have to re-familiarize myself with the process. Good web design means the pages are all similar 
which makes it harder to differentiate at what page your supposed to be on to do a certain function. Also, 
not being a cataloger, I was trying to teach my self how to add metadata to records. I learned a great deal 
and continue to archive information for my community. 

-This project has been eye-opening and very instructive for my department and our institution and I 
personally have also learned a lot. I wish we could have done more and I feel somewhat guilty that I didn't 
gain more proficiency in web archiving, but we all have to start somewhere. Even though we currently 
don't have the buy-in or funding to continue, I'm hoping this project can serve as an example for ongoing 
discussions with our IT department and administration about digital records in our collections. I also 
really appreciated the community of practitioners that we were able to connect with during the course of 
the project. 

-I loved this program. I was so unfamiliar with web archiving, and never thought a public library would 
have any involvement in something like this. I'm really excited that so many public libraries were able to 
dip their toes in the web archiving waters. I also loved that it brought awareness to what public libraries 
are capable of doing! 

-The program has been wonderful and I have learned a lot about web archiving. I think more virtual cohort 
meetings would be helpful in learning about what other libraries are doing but also to provide support for 
each other in the collecting process. 
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-I am so grateful for the opportunity to participate in this program. It not only gave me new skills, but it 
challenged me to rethink my entire approach to local history, and I am now working on instituting these 
changes. We had a nice mix of public libraries participating. I think the experience of the larger 
institutions were different from the smaller ones, so maybe grouping the similar institutions together at 
times would have been helpful. I hope you continue to solicit the participation of public libraries in the 
Archive-lt program. It is really important that this work continues to happen! 

-Cohort meetings were insightful, I would have welcomed additional in-person meeting time. Web 
archiving is as relevant to small institutions as large institutions. As I stated in the past, for a small library 
with limited technical exposure and limited access to hands-on training made this adventure challenging. 

I am now aware of the ease of policy implementation and community input on collection development for 
a small institution versus a larger institution. 

Moving forward will be a challenge unless further grant funds are procured. 

-I really enjoyed the program, the community of participants, and I really believe in the case for web 
archiving for public libraries. My experience was overall, terrific. I feel as though I might have benefited 
from assembling a small team of participants at my institution to at a minimum follow the online 
coursework for the grant. It was harder for me to sustain momentum on the project with the other job 
duties that come with my public library role. Some of the web archiving processes got much easier after 
much practice. Yet the coursework was all concentrated into the first 5 months or so of the grant. Maybe 
giving participants more time to get comfortable with the software and the process would allow for more 
productive learning? For example, we talked about metadata creation and community partnerships before 
I really felt ready for those steps in my own process. 

-I enjoyed the program and the process. I feel like this is a great way to capture digital born content and 
ensure that the resources that we are using today are available for future generations down the road. I 
think that we need to update people on the fact that their content should be maintained and preserved to 
not only keep their own information in a timeline, but also maintain longevity of information for future 
anniversaries, celebrations and the like. I enjoyed the learning atmosphere and the fact that we couldn't 
do it wrong. It helped us to organize our thoughts as we figured out the process. I appreciate the help and 
thoroughness of the staff and the presenters. 

-The biggest strength of the program was in building a community of public librarians with web archiving 
experience that are able to support one another going forward. 

-I met some amazing folks in the cohort who have linked me to great resources and project ideas. I am 
equipped and inspired to connect more public library folks interested in web archiving with the 
Community Webs curriculum and share what I've learned. 
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-This was a very positive experience! We learned a lot about the opportunities and challenges of web 
archiving. The presentations and blog posts form other participants were engaging and help spark new 
ideas. We are looking forward to seeing how our collections develop over the next few years. 

This was an absolutely fantastic experience. I'm thrilled that we are now web archiving. What I was not 
anticipating was the value of the cohort, and of the access to Archive-lt staff. I deeply appreciate how you 
valued and spent time with us. It says a lot to me that you operate at a national/international level but 
chose to invest so much in a program with public librarians. The experience encouraged me to develop 
my thinking around archiving in general, not just web archiving (although of course I learned plenty there 
tool). 
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